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Abstract 


A Cray-1  logic-timing  simulator  is  described.  This  simu- 
lator is  written  in  Fortran-IV  on  an  IBM  360/370  series  machine. 

The  simulator  provides  extensive  reporting  of  Cray-1  resource 
usage  and  resource  conflicts.  By  calling  the  simulator  as  a sub- 
routine, the  user  may  flexibly  use  Cray-1  program  simulation  within 
a larger  problem  environment.  Extensive  interactive  debugging 
features  make  the  Cray-1  simulator  a powerful  tool  for  the  develop- 
ment of  Cray-1  assembly  language  programs. 
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Preface 


The  simulator  described  in  this  report  was  developed  to  support 
general  vector  algorithm  studies  and  to  prepare  and  evaluate  per- 
formance of  3-dimensional  aerodynamic  fluid  flow  codes  on  a vector 
processor.  It  was  felt  to  be  of  sufficient  general  interest  and 
utility  that  this  documentation  was  prepared. 

The  simulator  will  accept  machine  code  from  either  of  two  cross 
assemblers:  (1)  a CDC  6000/7000  series  version  developed  at  the 

National  Center  for  Atmospheric  Research,  and  (2)  an  IBM  360/370 
series  version  developed  at  the  University  of  Michigan  and  described 
in  companion  SEL  Report  #120. 
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1. 


Introduction 


1 . 1 Forward 

The  University  of  Michigan  Cray-1  simulator  was  written  during 
1977-78.  The  decision  to  build  a simulator  was  motivated  by  the 
following  considerations: 

(1)  Access  to  a Cray-1  for  the  purposes  of  algorithm  design 
and  code  development  was  often  very  difficult  and  access  on  any 
continuing  basis  for  research  purposes  was  not  possible.  For  most 
research  purposes  simulation  will  provide  satisfactory  performance. 

(2)  Even  with  access  to  a Cray-1,  it  is  often  quite  difficult 
to  analyze  algorithm  performance.  This  is  because  there  is  no  hard- 
ware instrumentation  on  the  Cray-1  to  permit  a study  of  CPU  resource 
usage  and  conflict.  The  U of  M Cray-1  simulator  can  provide  a de- 
tailed report  of  CPU  activity. 

(3)  With  simulation  it  is  possible  to  study  the  impact  of 
Cray-1  architectural  modifications  on  algorithm  performance. 

(4)  For  algorithms  which  must  be  carefully  designed  and  coded, 
the  programmer  can  use  the  simulator  to  analyze  instruction  delays 
and  re-order  instructions  as  necessary  to  minimize  conflicts. 

(5)  When  debugging  programs,  it  is  useful  to  have  interactive 
control  of  program  execution.  Through  the  use  of  break-points,  at- 
points  and  command  files,  the  simulator  lends  considerable  flexi- 
bility to  the  debugging  process. 

Experience  with  the  Cray-1  simulator  has  shown  it  to  be  a 
very  useful  and  flexible  algorithm  design  and  code  development 
tool.  The  simulator  was  designed  for  use  on  the  Michigan  Terminal 
System  (MTS) . MTS  is  an  interactive  time-sharing  system  developed 
at  the  U of  M and  is  presently  running  on  an  Amdahl  470/V6  computer. 
This  machine  is  compatible  with  an  IBM  370/168.  As  a result,  the 
U of  M Cray-1  simulator  is  mildly  MTS  dependent  and  heavily  IBM 
370  dependent.  The  MTS  dependencies  are  related  to  I/O  usage  and 
the  370  dependencies  are  related  to  byte  addressing  and  the  32  bit 
word  of  the  370  architecture.  See  appendix  G.  for  program  availa- 
bility information. 
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1 . 2 Summary 


We  have  used  the  Cray-1  simulator  in  some  substantial  code 
development  work  that  has  shaken  out  a number  of  problems.  However, 
some  incompatibilities  with  the  Cray-1  hardware  do  exist  and  are 
noted  in  section  2.6. 

The  timing  accuracy  of  the  simulator  is  fairly  good.  We  have 
observed  a timing  error  on  the  order  of  ± 1/2%  with  programs 
that  have  been  run  on  the  Cray-1.  We  have  also  timed  sixty  six 
short  instruction  test  segments,  all  of  which  time  correctly.  With- 
out convenient  access  to  a Cray-1,  we  f ave  been  unable  to  further 
study  the  timing  error. 

The  Cray-1  simulator  currently  provides  two  kinds  of  reporting: 

(1)  A summary  report  that  includes  vector  unit  busy  times, 
floating  point  operation  counts,  average  vector  length 
calculation  and  data  flow  accounting. 

(2)  A resource  activity  report  which  permits  a study  of 
resource  conflicts  at  each  clock  period  of  simulation. 

For  more  information  on  reporting,  see  section  2.5. 

The  Cray-1  simulator  can  be  flexibly  integrated  within  a large 
problem  environment  through  the  following  two  features; 

(1)  The  simulator  may  be  called  as  subroutine.  This  permits 
the  user  to  embed  simulation  within  a larger  program, 
allowing  kernels  or  special  functions  to  be  simulated  while 
leaving  the  rest  of  the  program  in  Fortran. 

(2)  The  simulator  may  call  user  defined  subroutines  to  perform 
special  functions  (eg.  I/O,  SQRT  etc.)  that  may  be  already 
available  on  the  Cray-1. 

See  section  3 for  an  example  of  this. 

As  a front-end  to  the  Cray-1  simulator,  the  command  language 
provides  the  user  interactive  control  over  the  simulation.  The  user 
may  set  break  points  and  display  or  alter  the  registers  and  memory 
of  the  simulated  Cray-1.  As  implemented  at  the  University  of  Michigan, 
the  simulator  detects  floating  point  operation  exceptions,  operand 
reference  errors  and  branch  errors  and  returns  control  to  the  user 
for  investigation. 
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The  Cray-1  simulator  also  provides  a means  for  studying 
architectural  modifications  to  the  Cray-1.  One  option  currently 
implemented  is  the  ability  to  specify  an  improved  memory  bandwidth. 
With  this  option  enabled,  vector  loads  and  stores  move  data  to  and 
from  main  memory  at  the  fastest  data  rate  possible  consistent  with 
the  address  increment.  Depending  on  the  increment,  the  data  rate 
ranges  between  four  words  per  clock  period  and  1/4  word  per  clock 
period.  As  algorithm  study  on  the  Cray-1  develops,  the  simulator 
can  be  used  to  assess  the  impact  of  Cray-1  architectural  changes 
on  algorithm  performance. 
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2. 


Simulator  Features 


The  U of  M Cray-1  simulator  was  designed  to  time,  as  accurately 
as  possible,  the  instruction  resource  usage  of  the  Cray-1  vector 
processor.  This  section  of  the  user  manual  has  been  divided  into 
six  sub-sections,  each  devoted  to  a particular  aspect  of  the  Cray-1 
simulator.  No  attempt  has  been  made  to  describe  the  architecture 
of  the  Cray-1  itself.  The  bibliography  lists  several  sources  for 
this  information. 

The  following  is  an  overview  of  the  material  covered  in  this 
section : 

(1)  Sub-section  2.1  includes  an  introduction  to  the  simulator 
command  language  and  the  running  of  simulated  programs. 

(2)  Sub-section  2.2  covers  the  exceptional  conditions  that 
may  arise  when  using  the  simulator.  For  example,  kevboard  atten- 
tion interrupts,  arithmetic  overflows,  invalid  simulator  memory 
references,  etc. 

(3)  Sub-section  2.3  covers  the  subroutine  interface  through 
which  a Fortran  program  may  call  the  Cray-1  simulator.  This 

is  useful  for  simulating  only  a portion  of  a program,  while  retaining 
the  rest  of  it  in  Fortran-IV  for  either  cost  or  convenience  reasons. 

(4)  Sub-section  2.4  covers  the  simulator  exit  processing. 

Through  the  Cray-1  Exit  instruction  the  user  may  have  the  simulated 
program  call  a user  provided  subroutine  to  perform  functions  that 
might  be  provided  by  the  operating  system  or  the  subroutine  libraries 
in  an  actual  Cray-1  environment. 

(5)  Sub-section  2.5  covers  the  report  generation  facilities 
of  the  simulator.  This  reporting  is  controlled  by  the  CPACT  and 
STAT  commands. 

(6)  Sub-section  2.6  covers  inconsistencies  between  the  simu- 
lator and  the  Cray-1  computer  that  are  presently  known.  Un implemented 
instructions  are  discussed  here  along  with  other  minor  inconsistencies 
such  as  data  formats,  timing  inaccuracies,  etc. 
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2 . 1 Command  Language 

The  command  language  provides  the  user  interface  to  the  Cray-1 
simulator.  Through  the  commano  language,  the  user  controls  and 
monitors  the  progress  of  the  simulated  program.  The  user  has  con- 
siderable flexibility  in  controlling  input  to  and  output  from  the 
simulator.  This  section  is  organized  into  the  following  four  sub- 
sections; 

(1)  Command  language  input  control 

(2)  Command  language  output  control 

(3)  Simulator  control 

(4)  Running  programs  on  the  Cray-1  simulator 


2.1.1  Command  Language  Input  Control 

When  starting  up  the  simulator  in  stand-alone  mode  (by  the  MTS 
command  - $RUN  CRAYl),  the  simulator  will  prompt  for  terminal  input 
by  typing  a period.  The  user  may  then  enter  a command  or  redirect 
the  command  input  stream  to  read  from  a file  via  the  USE  command. 

The  filename  parameter  on  the  USE  command  directs  the  simulator  to 
open  that  file  and  begin  reading  commands.  Upon  an  end-of-file 
condition  the  input  stream  is  switched  back  to  the  terminal. 

More  than  one  USE  command  may  be  issued,  allowing  nested  command 
files  to  be  built  by  the  user.  The  simulator  command  language  main- 
tains a command  stream  input  stack  which  controls  the  issue  of  nested 
USE  coramands. 

The  command  stack  is  also  used  when  the  simulator  is  called  as 
a subroutine  (see  section  2.3).  For  subroutine  usage,  the  caller 
supplied  command  string  is  split  at  the  command  separator  character 
(a  semi-colon)  and  each  command  is  written  to  a scratch  file.  This 
scratch  file  is  termed  the  call-file.  The  call-file  is  terminated 
with  a RETURN  command,  so  that  after  execution  of  the  caller  commands 
automatic  return  is  made  from  the  simulator  to  the  caller.  After 
creating  the  call-file,  the  subroutine  interface  pushes  the  call-file 
onto  the  command  stack  causing  subsequent  commands  to  come  from  the 
call-file. 
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Another  use  of  the  command  stack  arises  from  the  use  of  AT 
points  that  may  be  set  by  the  user.  An  AT  point  is  similar  to  a 
break  point,  in  that  each  is  set  at  some  instruction  address  in 
the  user's  proqram.  Upon  hitting  a break  point,  program  simulation 
is  halted  and  control  reverts  to  the  terminal  allowing  the  user  to 
monitor  the  program's  behavior.  An  AT  point  differs,  in  that  when 
it  is  created  the  user  may  also  enter  one  or  more  simulator  commands 
that  will  be  automatically  executed  when  the  AT  point  is  hit.  These 
commands  are  saved  in  a scratch  file  and  then,  during  simulation 
when  the  AT  point  is  hit,  the  simulator  pushes  the  AT  point's  scratch 
file  onto  the  command  stack  causing  subsequent  commands  to  come  from 
the  AT  file.  A RUN  command  is  automatically  placed  at  the  end  of  the 
AT  file,  causing  simulation  to  resume  uninterrupted  after  the  AT  com- 
mands have  been  processed.  AT  commands  are  useful  for  automatically 
displaying  register  or  memory  locations  at  selected  points  in  a pro- 
gram. In  cases  where  the  user  wishes  to  display  various  locations 
and  then  regain  control  for  other  purposes,  entering  the  command 
USE  *MSOURCE*  will  switch  command  input  to  the  terminal  during  AT 
command  processing.  Any  end-of-file  condition  ($ENDFILE  or  control 
C)  will  terminate  input  from  the  top  entry  of  the  command  stack, 
causing  the  stack  to  be  popped  and  input  to  continue  from  the  pre- 
vious source.  In  the  case  of  an  AT  file  with  a 'USE  *MSOURCE*' 
command  in  it,  an  end-of-file  condition  from  the  terminal  will  resume 
simulation.  In  fact,  when  a break  point  is  hit,  the  simulator  auto- 
matically issues  an  implied  'USE  *MSOURCE*'  command  which  reverts 
control  to  the  terminal. 

The  command  stack  is  fifteen  levels  deep  with  the  base  entry 
preset  to  *MSOURCE*  which  can  never  be  popped.  Only  one  AT  or  BREAK 
point  can  be  hit  at  any  time,  therefore  a subsequent  RUN  command  will 
pop  the  command  stack  through  the  last  AT  or  BREAK  entry  on  the  stack. 
Upon  a RETURN  command  the  command  stack  will  be  popped  through  the 
last  call-file  entry  on  the  stack. 

Occasionally  due  to  an  error  condition  the  message  "Command 
Stack  Reset"  will  be  printed.  This  means  that  the  command  stack  has 
been  cleared  to  the  base  entry  which  is  preset  to  *MSOURCE*.  This 
assures  that  the  error  condition  will  return  input  control  to  the  user. 
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However,  this  means  that  any  commands  not  yet  executed  in  any  out- 
standing call-files,  AT  files  or  USE  files  have  been  lost. 

A keyboard  attention  interrupt  will  cause  the  command  stack  to 
be  reset.  This  is  useful  to  stop  a USE  file  or  prevent  subsequent 
commands  in  the  call-file  from  being  processed. 

2.1.2  Command  Language  Output  Control 

Normal  output  from  the  simulator  (informational  messages,  DIS- 
PLAY output,  etc.)  can  be  sent  to  another  file  or  device  by  using 
the  SET  command  to  switch  the  output  device.  For  example,  SET  OUTPUT 
*PRINT*,  would  route  the  output  to  the  printer. 

Error  messages  are  output  on  a different  unit  number  and  always 
go  to  *MSINK*  (the  terminal) . If  an  error  situation  arises  causing 
the  message  "Command  Stack  Reset"  to  appear,  the  output  device  will  be 
switched  back  to  *MSINK*,  if  it  was  diverted  elsewhere.  Also,  a 
keyboard  attention  will  switch  the  output  back  to  *MSINK*. 

2.1.3  Simulator  Control 

To  keep  the  simulator  from  running  away  from  the  user  a keyboard 
attention  interrupt  can  be  signalled  which  has  the  following  effects: 

(1)  Resets  the  command  input  stack  to  read  from  *MSOURCE*  (the 
terminal),  losing  any  outstanding  command  files. 

(2)  Resets  the  output  device  back  to  *MSINK*  (the  terminal) 

(3)  Performs  the  following  command  dependent  actions: 

3.1)  For  a DISPLAY  command,  an  attention  will  terminate 
the  output.  This  is  useful  if  a long  display  region  was 
accidently  displayed. 

3.2)  For  a HELP  or  STAT  command,  an  attention  will  term- 
inate the  output. 

3.3)  For  a RUN  command,  an  attention  will  stop  the  simula- 
tion and  print  the  parcel  address  of  the  next  instruction 
to  be  executed.  Simulation  may  be  resumed  without  any  loss 
of  timing  information  by  just  entering  a "RUN"  command.  No 
parcel  address  should  be  supplied  on  the  RUN  command,  as 
this  always  forces  a buffer  fetch  which  will  make  the  timing 
inaccurate. 
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4)  If  for  any  reason  the  simulator  seems  to  be  looping  and 
not  responding  to  attentions,  two  attentions  will  return 
control  to  MTS. 

Attention  trapping  is  only  enabled  while  control  is  inside  the 
simulator  or  the  command  language.  That  is,  if  the  simulator  is 
called  as  a subroutine,  attention  trapping  is  enabled  only  while  a 
call  to  the  CRAYl  interface  subroutine  is  active. 

2.1.4  Running  Programs  on  the  Cray-1  Simulator 

Before  a program  may  be  run  on  the  simulator,  it  must  first  be 
translated  to  a format  acceptable  for  loading  into  the  simulator. 

This  translation  is  typically  done  via  a Cray-1  cross  assembler. 

This  assembler  generates  absolute  or  relocatable  load  modules  that 
can  be  loaded  by  the  simulator  LOAD  command.  The  format  of  the 
load  module  is  described  in  appendix  K. 

When  designing  a Cray-1  program  to  be  simulated,  consideration 
must  be  given  first  to  the  nature  of  the  algorithm  under  study.  If 
the  program  requires  some  initialization  which  won't  be  written  in 
Cray-1  assembly  language,  then  perhaps  the  simulator  should  be  called 
as  a subroutine.  It  is  possible  for  the  calling  program  and  the 
simulator  to  both  share  the  Fortran  common  block  that  is  used  for  the 
simulated  Cray-1  memory.  In  fact,  if  the  users  calling  progreim  is 
loaded  first  by  MTS,  the  user  may  increase  the  size  of  the  simulated 
Cray-1  memory  beyond  the  4096  IBM  double  words  that  are  presently 
allocated.  For  example,  if  the  calling  program  was  in  the  MTS  file 
MAIN.O,  the  MTS  command  to  start  up  the  progreim  would  be 

$RUN  MAIN.O+CRAYl 

The  file  CRAYl  contains  the  object  module  of  the  simulator.  See 
section  2.3  for  a discussion  about  calling  the  simulator  as  a sub- 
routine. In  this  example,  execution  would  begin  in  the  user's  main 
program  which  would  initialize  the  data  area  in  the  Cray-1  memory  as 
required  by  the  algorithm  to  be  simulated.  The  main  program  would 
then  call  the  subroutine  CRAYl,  passing  a simulator  command  string 
which  could  load  the  Cray-1  program  and  begin  simulacion. 
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If  the  algorithm  under  study  requires  the  use  of  intrinsic 
functions,  such  as  SQRT,  SIN,  COS,  etc,  which  would  be  supplied  by 
some  Cray-1  subroutine  library,  the  user  may  provide  these  functions 
through  the  use  of  Cray-1  Simulator  EXIT  instruction  dispatcher. 

■nie  Cray-1  EXIT  instruction  (assembler  mnemonic  EX  exp)  contains  a 
9 bit  expression  field.  If  this  field  is  non-zero  the  simulator 
will  call  a subroutine  called  CRAYEX,  passing  the  value  of  the 
expression  field  and  several  register  arguments  to  it.  The  user 
may  write  a CRAYEX  subroutine  to  process  these  EXIT  codes  and  per- 
form any  function  he  wishes  to  define.  For  example,  an  EXIT  code  of 
one  could  be  defined  to  perform  a square  root  operation.  This  EXIT 
feature  avoids  the  expense  of  simulating  Cray-1  code  for  such  in- 
trinsic functions  by  allowing  them  to  be  programmed  directly  on  the 
host  machine.  See  section  2.4  for  a complete  discussion  of  the 
EXIT  dispatcher. 

Several  other  differences  between  the  Cray-1  computer  and  the 
simulator  arise  due  to  the  nature  of  the  IBM  370  architecture  upon 
which  the  simulator  runs. 

To  speed  the  simulation  of  arithmetic,  all  the  arithmetic  is 
done  using  the  IBM  370  arithmetic  instructions.  The  alternative 
would  be  to  simulate  the  Cray-1  arithmetic,  further  raising  the 
simulation  cost.  As  a consequence  of  using  host  machine  (IBM  370) 
arithmetic,  integers  in  the  simulator  are  32  bits  wide  whereas  Cray-1 
integers  are  only  24  bits  wide.  Although  the  simulator  memory  word 
is  a 64  bit  370  double  word,  which  is  the  same  width  as  a Cray-1 
memory  word,  the  floating  point  data  format  is  different.  On  the 
Cray-1  the  sign  and  exponent  field  is  16  bits  wide  whereas  on  the  IBM 
370  it  is  only  8 bits  wide.  Further,  the  Cray-1  exponent  is  a base 
2 exponent  whereas  the  IBM  370  exponent  is  base  16.  Figure  2.1.1 
shows  the  different  formats. 
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Figure  2.1.1  - Cray-1  vs.  IBM  370  Floating  point 
data  formats 

The  simulation  of  the  instruction  computation  is  done  in  it's 
entirety  when  the  instruction  issues.  The  pipeline  data  flow  in  the 
Cray-1  is  not  simulated.  This  means  that  upon  hitting  a BREAK  or 
AT  point,  all  results  of  prior  instructions  are  available  for  inspection 
or  modification.  The  instruction  where  the  BREAK  or  AT  point  is  set 
has  not  yet  been  executed. 

There  are  three  methods  for  controlling  the  simulation  of  a 
Cray-1  program: 

(1)  BREAK  points 

(2)  AT  points 

(3)  An  instruction  issue  limit  parameter. 

BREAK  and  AT  points  may  be  set  at  a specified  parcel  address 
in  the  simulated  program.  Setting  BREAK  or  AT  points  do  not  change 
the  instruction  at  that  location,  rather,  BREAK  and  AT  points  are 
detected  by  monitoring  the  P address  register.  This  permits  BREAK 
and  AT  points  to  be  set  before  the  program  is  loaded  or  reloaded. 
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When  a BREAK  point  is  hit,  control  goes  to  the  terminal.  When 
an  AT  point  is  hit,  a predefined  command  file  is  processed  which  was 
created  when  the  AT  point  was  set.  Control  will  not  go  to  the  ter- 
minal when  an  AT  point  is  hit  if  no  command  causes  this  to  happen. 

An  instruction  issue  limit  may  be  provided  as  an  optional  para- 
meter on  the  simulator  RUN  command.  For  example,  the  following  RUN 
command  would  begin  execution  at  parcel  address  21A  and  cause  control 
to  return  to  the  command  language  after  2500  Cray-1  instructions  have 
been  issued  (unless  an  EXIT  instruction  or  error  condition  occurred) . 

RUN  21A  #2500 

The  issue  limit  parameter  is  a decimal  number  prefixed  by  a pound 
sign.  If  no  issue  limit  is  specified  the  remaining  amount  of  a pre- 
vious limit  is  used  (in  the  case  of  a BREAK  or  AT  or  attention) . If 
there  is  no  remaining  amount,  a default  value  of  1000  is  used.  To 
single  step  through  a program  use  the  command: 

RUN  #1 

Without  a parcel  address  specified,  simulation  proceeds  with  the 
instruction  pointed  at  by  the  P address  register. 

While  in  the  command  language^  the  user  may  display  or  change 
registers  and  memory  locations  by  using  the  DISPLAY  and  CHANGE  commands. 

The  HELP  command  may  be  used  if  you  are  unsure  of  command  syntax 
or  the  function  of  a command.  For  example 

HELP  RUN 

will  describe  the  simulator  RUN  command. 

See  section  4 for  command  descriptions  of  all  simulator  commands. 

The  cost  of  simulating  Cray-1  programs  is  an  important  factor. 

The  simulator  provides  three  levels  of  cost  control: 

Level  1 - Result  computation  only,  still  allows  debugging  but 

eliminates  the  cost  associated  with  timing  the  Cray-1 
instructions. 

Level  2 - Timing  enabled,  allows  the  timing  of  the  simulated  pro- 
gram at  a cost  of  about  10  times  the  level  one  cost. 

Level  3 - CPACT  (clock  period  activity  report)  enabled,  increases 
the  cost  to  about  30  times  the  level  one  cost. 

Section  five  treats  the  cost  issue  further. 
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2 . 2 Exceptional  Conditions 

While  simulating  a Cray-1  program,  the  simulator  may  en- 
counter any  of  several  exceptional  conditions  which  will  halt  the 
simulation.  The  six  possible  exceptional  conditions  are  listed 
below  followed  by  a discussion  of  each  one: 

1)  Error  exit 

2)  Program  range  error 

3)  Operand  range  error 

4)  Floating  point  interrupt 

5)  Invalid  instruction  executed 

6)  Attention  interrupt. 

An  occurance  of  any  exceptional  condition  will  reset  the  com- 
mand stack  and  switch  OUTPUT  back  to  the  terminal  if  it  was  diverted 
elsewhere.  The  name  of  the  routine  being  executed  will  be  displayed, 
if  possible. 

2.2.1  Error  Exit 

An  error  exit  is  caused  when  the  Cray-1  executes  a zero  op-code. 
The  simulator  signals  this  condition  by  printing  the  message: 

ERROR  EXIT  AT  - p-addr 

where  p-addr  is  the  parcel  address  of  the  error  exit  instruction. 

Since  memory  is  initialized  with  zeros  when  the  simulator  is  started 
up,  a bad  or  missing  branch  could  cause  an  error  exit. 

2.2.2  Program  range  error 

A program  range  error  is  caused  by  a branch  instruction  which 
attempts  to  jump  outside  the  limits  of  the  currently  defined  simulator 
memory.  If  used  stand-alone,  4096  words  of  simulator  memory  are 
available.  A program  range  error  is  signalled  by  the  message, 

PROGRAM  RANGE  ERROR. 

BRANCH  AT  bch-p-addr 
TARGET  ADDRESS  WAS  tar-p-addr 
MEMORY  SIZE  IS  msize 
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The  parcel  address  of  the  offending  branch  instruction  is 
given  by  the  bch-p-addr  field.  The  invalid  target  address  of  the 
offending  branch  instruction  is  given  by  the  tar-p-addr  field. 

The  memory  size  (msize)  is  printed  in  octal  for  comparison  with 
the  invalid  target  address  and  to  inform  the  user  of  the  current 
memory  size. 

If  the  user  has  tried  to  extend  the  size  of  the  simulated 
Cray-1  memory  by  loading  a longer  common  block,  he  must  inform 
the  simulator  of  this  by  setting  the  MEMStZ  word  in  the  MSIZE  common 
block  to  the  correct  size  of  the  Cray-1  memory  (see  section  2.3).  If 
the  user  forgets  to  do  this  a size  of  4096  is  assumed  which  may  cause 
the  program  range  error. 

2.2.3  Operand  range  error 

An  operand  range  error  is  caused  by  an  operand  load  or  store 
that  exceeds  the  limits  of  the  currently  defined  simulator  memory. 

If  used  stand-alone,  4096  words  of  simulator  memory  are  available. 

An  operand  range  error  is  signalled  by  the  message, 

OPERAND  RANGE  ERROR  AT  P = p-addr 
MEMORY  SIZE  IS  msize 

The  parcel  address  of  the  offending  memory  reference  instruction 
IS  given  by  the  p-addr  field.  The  memory  size  (msize)  is  printed 
in  octal  to  inform  the  user  of  the  current  memory  size.  The  comments 
above  (under  program  range  error) , about  user  extension  of  Cray-1 
memory,  apply  here  as  well. 

A vector  load  or  store  to  memory  can  cause  an  operand  range 
error  in  several  ways: 

1)  The  base  address  may  be  out  of  range 

2)  The  operand  increment  may  be  too  large 

3)  The  vector  length  may  be  too  large. 

2.2.4  Floating  point  interrupt 

The  floating  point  interrupt  exception  is  handled  differently 
by  the  simulator  than  it  is  on  the  Cray-1.  This  discussion  will  deal 
with  the  simulator  response  to  a floating  point  interrupt.  See  the 
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Cray-1  Reference  Manual  for  the  Cray-1  response. 

The  simulator  response  to  a floating  point  interrupt  is  a 
consequence  of  the  behavior  of  the  IBM  370  architecture.  Three 
types  of  floating  point  interrupts  may  occur: 

1)  Exponent  overflow 

2)  Exponent  underflow 

3)  Division  by  zero. 

All  three  types  of  floating  point  interrupt  may  be  suppressed 
if  the  floating  point  interrupt  bit  in  the  Cray-1  mode  register  is 
clear.  When  the  simulator  starts  up,  this  mode  register  bit  is  set, 
thereby  enabling  all  three  types  of  floating  point  interrupts.  The 
setting  of  this  mode  register  bit  may  be  controlled  by  the  user  in 
two  ways: 

1)  Through  the  SET  EFI  = [ON/OFF]  command,  the  user  may 
enable  or  disable  floating  point  interrupts. 

2)  Through  the  Cray-1  instructions  EFI  and  DFI,  the  program 
may  enable  or  disable  floating  point  interrupts. 

Only  one  floating  point  interrupt  is  detected  for  each  instruction 
simulated.  This  means  that  if  a vector  instruction  causes  20  exponent 
overflows,  only  one  will  be  detected.  After  the  instruction  has 
finished  executing  the  simulator  will  announce  the  floating  point 
exception  (if  the  EFI  mode  bit  is  set)  and  return  to  the  command 
language . 

When  an  exponent  overflow  occurs,  the  following  message  is 
printed : 

**  EXPONENT  OVERFLOW  ** 

FLOATING  POINT  ERROR  AT  P = p-addr 

When  an  exponent  underflow  occurs,  the  following  message  is 
printed : 

**  EXPONENT  UNDERFLOW  ** 

FLOATING  POINT  ERROR  AT  P - p-addr 

When  a division  by  zero  occurs,  the  following  message  is 
printed : 
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**  FLOATING  POINT  DIVIDE  CHECK  ** 

FLOATING  POINT  ERROR  AT  P = p-addr 

For  each  of  the  three  messages  the  parcel  address  (p-addr) 
of  the  instruction  causing  the  interrupt  is  printed. 

2.2.5  Invalid  instruction  executed 

The  monitor  mode  Cray-1  instructions  are  not  implemented  on 
the  simulator.  When  one  of  these  is  executed,  the  simulator  will 
print  the  message, 

**  ATTEMPT  TO  EXECUTE  INVALID  INSTRUCTION  AT  : p-addr 

will  be  printed  and  the  simulator  will  return  to  the  command  lang- 
uage. The  offending  instruction’s  parcel  address  (p-addr)  is  printed 
to  aid  in  finding  the  instruction. 

2.2.6  Attention  interrupt 

To  stop  the  simulation  or  regain  control  during  command  file 
processing,  the  MTS  terminal  user  may  issue  a )ceyboard  attention 
by  hitting  the  broa)c  )cey  or  a control-E.  This  attention  interrupt 
will  reset  the  command  stacJc  and  halt  simulation  if  in  progress.  If 
simulation  was  in  progress  the  message, 

**  SIMULATOR  ATTN  AT  P = p-addr  ** 

will  be  printed,  where  p-addr  is  the  instruction  to  execute  next 
if  simulation  is  continued.  An  attention  will  cause  no  information 
to  be  lost  and  simulation  may  be  resumed,  as  if  never  interrupted, 
by  entering  a RUN  command  without  a p-addr  parameter. 
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2.3  Subroutine  Interface 

j — ■ ■ ■■■  - - . .. 

The  Cray-1  simulator  may  be  called  as  a subroutine  from  a 
user  Fortran-IV  program.  Three  benefits  provided  by  this  inter- 
face are: 

1)  Being  able  to  convert  only  a portion  of  a Fortran  pro- 
gram to  Cray-1  assembly  language,  allows  you  to  simulate 
the  converted  portion  while  leaving  the  remaining  in  Fortran 
to  run  more  efficiently  on  the  host  machine. 

2)  Being  able  to  enlarge  the  amount  of  simulated  Cray-1 
memory  by  extending  the  memory  common  block  in  the 
user's  calling  program  and  loading  this  program  first. 

This  avoids  the  need  for  recompiling  the  simulator. 

3)  When  studying  a given  algorithm,  for  application  to  the 
Cray-1,  it  is  convenient  to  perform  any  housekeeping  and 
initialization  functions  in  the  user's  Fortran  program. 
Therefore,  only  the  algorithm  need  be  coded  in  Cray-1 
assembly  language. 

This  section  will  discuss  the  protocol  used  to  communicate  with 
the  simulator  from  a calling  program.  This  communication  has  two 
aspects  to  it:  (1)  the  subroutine  interface  used  to  pass  commands 
and  control  to  the  simulator  and  (2)  the  shared  Cray-1  memory  inter- 
face used  to  pass  data  to  and  from  the  simulator. 

2.3.1  Simulator  subroutine  call 

To  access  the  simulator  as  a subroutine  the  following  Fortran 
subroutine  call  is  used: 

CALL  CRAYl ( ' cmd [ ;cmd  ] ...  I ' ,echosw) 

The  first  argument  is  a literal  string  enclosed  by  apostrophes 
which  may  be  composed  of  one  or  more  simulator  commands.  Each  com- 
mand follows  the  same  syntax  as  the  commands  described  in  section  4. 
To  specify  multiple  commands  with  a single  call  to 
the  simulator,  separate  the  commands  with  a semicolon.  The  entire 
command  string  must  be  terminated  with  an  exclamation  point  and  may 
not  exceed  200  characters. 
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The  second  parameter  (echosw)  is  a logical  constant  or  variable. 
This  parameter  controls  the  echoing  of  the  commands  passed  in  the 
first  argument.  If  echosw  is  .TRUE.,  the  commands  will  be  echoed 
to  the  current  simulator  output  device  as  they  are  processed.  If 
echosw  is  .FALSE.,  command  echoing  is  suppressed. 

If  the  user  wants  to  give  control  to  the  terminal  at  some  point 
in  the  conTvind  string  se^fuence,  the  command  USE  *MSOURCE*  will  allow 
additional  commands  to  be  read  from  the  terminal.  For  example,  the 
call 

CALL  CRAYK’LOAD  TRIDECjUSE  *MSOURCE*  ; RUN  21A  # 2 000 ! ' , . TRUE . ) 

will  cause  the  file  TRIDEC  to  be  loaded  into  the  simulator  memory 
after  which, the  USE  command  will  cause  control  to  go  to  the  user's 
terminal,  allowing  breakpoints  to  be  set,  etc.  An  end-of-file  con- 
dition at  the  user's  terminal  (via  ENDFILE,  control-c,  etc.)  will 
terminate  the  USE  command  permitting  the  "RUN  21A  #2000"  command  to 
be  executed.  VJhen  the  last  command  in  the  command  string  is  executed 
an  automatic  return  is  made  to  the  caller  of  the  simulator.  By  set- 
ting the  echosw  parameter  to  .TRUE. , the  three  passed  commands  will 
be  echoed  to  the  simulator  output  device  as  they  are  processed. 

In  order  to  call  the  simulator  as  a subroutine,  the  user's  pro- 
gram must  first  get  control.  To  accomplish  this,  two  things  must  be 
done : 

1)  The  user  program  must  be  set  up  as  a main  program. 

2)  The  user  program  must  be  loaded  before  the  simulator  is 
loaded . 

This  is  a consequence  of  the  following  two  facts: 

1)  When  MTS  starts  up  a Fortran  program  (via  the  MTS  $RUN 
command) , control  is  given  to  the  main  program. 

2)  When  the  MTS  loader  encounters  more  than  one  main  program 
it  ignores  all  but  the  first  one. 

The  simulator  has  a small  internal  main  program  which  gets  control  if 
the  simulator  is  run  stand-alone.  But,  if  the  user  writes  a main 
program  and  loads  it  before  the  simulator  is  loaded,  the  simulator's 
main  program  is  ignored  by  the  loader.  Therefore,  when  loading  is 
finished  MTS  will  give  control  to  the  user's  main  program. 
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As  an  example,  suppose  the  user  wrote  the  following  Fortran 
main  program  and  compiled  it  into  the  MTS  file  MAIN.O. 

CALL  CRAYICUSE  ‘SOURCE*  FALSE . ) 

STOP 

END 

To  use  this  main  program  and  have  it  get  control  first,  use  the 
following  MTS  run  command: 

$RUN  MAIN.O+CRAYl 

Although  most  user  main  programs  would  be  more  complicated  than 
this  one,  this  main  program  is  in  fact  the  small  internal  main 
program  used  by  the  simulator. 

2,3.2  Simulator  memory  sharing 

The  simulated  Cray-1  memory  can  be  shared  both  by  the  simulator 
and  the  user's  calling  program.  This  is  accomplished  by  having  the 
user  include  in  his  program  the  Fortran  common  block  declaration  used 
by  the  simulator  to  allocate  the  Cray-1  memory  space.  This  commom 
block  declaration  appears  in  the  simulator  as  follows: 

DOUBLE  PRECISION  MEM 
COMMON  /MEMORY/  MEM  (4096) 

COMMON  /MSIZE/  MEMSIZ 
INTEGER  IMEM(2,1) 

EQUIVALENCE  (MEM ( 1 ) , IMEM (1,1)) 

The  MSIZE  common  block  contains  the  single  word  MEMSIZ  whose 
value  is  the  current  size  of  Cray-1  memory.  MEMSIZ  is  used  to  per- 
form bounds  checking  on  branches  and  memory  references  made  by  the 
simulated  Cray-1  instructions.  When  the  simulator  is  called  for 
the  first  time  some  once-only  initialization  is  done  which  includes 
zeroing  all  of  Cray-1  memory  (MEM) . Therefore,  MEMSIZ  must  be  init- 
ialized properly  before  the  first  call  to  CRAYl.  Further  since  the 
once-only  initialization  will  zero  Cray-1  memory,  the  very  first 
call  to  CRAYl  must  be  made  before  the  user's  calling  program  initializes 
any  of  MEM.  It  is  suggested  that  this  first  initialization  call  be 
made  as  follows: 
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CALL  CRAYl { ' INITl ' , .FALSE. ) 


The  MEMORY  conunon  block  contains  the  array  MEM,  which  is  used 
as  the  Cray-1  memory  by  the  simulator.  This  is  declared  in  the 
simulator  to  be  4096  double  words  long.  The  user  may  extend  this 
common  block  to  enlarge  the  Cray-1  memory.  This  is  done  by  writing 
a Fortran  main  program  which  includes  the  common  declaration  state- 
ments shown  above,  but  with  the  4096  constant  replaced  with  a larger 
value  as  needed.  Then  by  loading  the  user  main  program  first  (see 
section  2.3.1),  the  user's  main  program  not  only  replaces  the  simu- 
lator's main  program,  but  the  user's  enlarged  version  of  the  MEMORY 
common  block  replaces  the  simulator's  version. 

To  pass  data  to  and  from  the  simulator  Cray-1  memory,  the  user 
need  only  read  and  write  data  to  the  MEM  array.  However,  because  the 
Cray-1  memory  address  starts  at  location  zero  and  Fortran  arrays  are 
indexed  beginning  at  one,  the  user  must  formulate  the  index  into  MEM 
by  using  the  Cray-1  memory  address  and  adding  one  to  it.  For  example, 
Cray-1  memory  location  3 is  MEM(4). 

The  following  example  is  the  skeletal  structure  of  a user  main 
program  which  extends  Cray-1  memory  to  8192  words. 

DOUBLE  PRECISION  MEM 
COMMON  /MEMORY/  MEM (8192) 

COMMON  /MSIZE/  MEMSIZ 
INTEGER  IMEM(2,1) 

EQUIVALENCE  (MEM  (1)  , IMEM  (1,1)  ) 

C 

C ....  SET  UP  MEMSIZ  WITH  THE  NEW  MEMORY  SIZE. 

MEMSIZ  = 8192 
C 

C ....  DO  SIMUIATOR  ONCE-ONLY  INITIALIZATION 
CALL  CRAYl (' INIT! ', .FALSE. ) 

I User  initialization  of  Cray-1  memory 

CALL  CRAYl ('LOAD  UPROG;RUN  20A  #1 000 !',. TRUE . ) 

I User  prints  out  results  of  simulated  computation 

STOP 

END 

See  section  3.2  for  a more  complete  example  of  accessing  the  simulator 
as  a subroutine. 
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2 . 4 CRAYEX  Exit  Dispatcher 

As  discussed  in  section  2.3,  it  is  often  useful  to  allow  the 
Cray-1  simulation  to  be  embedded  as  a portion  of  a larger  Fortran 
program.  Conversely,  it  is  also  useful  to  be  able  tx3  call  a Fortran 
program  from  within  the  simulated  Cray-1  program.  This  transfer 
of  control  from  the  Cray-1  program  to  a Fortran  program  is  acconpl ished 
through  the  use  of  the  Cray-1  exit  instruction. 

The  Cray-1  assembly  language  mnemonic  for  the  exit  instruc- 
tion is  shov/n  below: 


EX  i jk 

The  exit  code  field  (ijk)  is  a nine  bit  field  within  the  exit  in- 
struction. Exit  codes  may  range  from  zero  to  511  decimal,  when 
the  simulator  encounters  an  exit  instruction,  it  checks  the  exit 
code  field  (ijk)  for  a non-zero  value.  If  ijk  is  zero,  a normal 
Cray-1  program  exit  is  performed.  If  ijk  is  non-zero,  the  simula- 
tor will  call  the  subroutine  CRAYEX.  If  the  user  supplies  a CRAYEX 
subroutine  and  loads  it  first  (see  section  2.3.1),  the  user's 
CRAYEX  routine  will  get  control.  If  no  user  CRAYEX  routine  is 
provided,  the  simulator  will  perform  a normal  Cray-1  program  exit. 

If  the  user  provides  a CRAYEX  routine  the  simulator  will  call  it  with 
the  following  Fortran  subroutine  call  statement: 

CALL  CRAYEX (IJK,  AREG , SREG,  VREG , VL,  EXSW) 

The  arguments  passed  by  the  subroutine  call  are  discussed  below: 

IJK  - This  input  parameter  is  an  integer  which  contains  the  value 
of  the  ijk  field  in  the  exit  instruction.  It  may  be  used 
as  a dispatch  parameter , allowing  different  exit  codes  to  per- 
form different  functions. 

AREG  - This  parameter  is  an  eight  element  integer  array  used  to 

pass  the  Cray-1  A-register  contents  to  CRAYEX.  This  allows 
arguments  to  be  provided  and  results  returned  through  the 
A-registers.  Cray-1  register  A0  corresponds  to  AREG(l). 
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SRF.r,  - This  parameter  is  an  eight  element  double  precision  array 
used  to  pass  the  Cray-1  S-register  contents  to  CRAYEX. 

This  allows  arguments  to  be  provided  and  results  returned 
through  the  S-registors.  Cray-1  register  SjZ)  corresponds 
to  SREG(l). 

VREG  - This  parameter  is  a double  precision  array,  dimensioned 

as  (64,8),  used  to  pass  the  Cray-1  Vector  register  contents 
to  CRAYEX.  Arguments  may  be  provided  and  results  returned 
through  the  vector  registers.  Cray-1  vector  register  V0 
corresponds  to  VREG(-,1). 

VL  - This  parameter  is  an  integer  which  contains  the  value  of 

the  Cray-1  vector  length  register.  On  entry  to  CRAYEX,  VI, 
will  always  be  between  one  and  64.  VL  may  be  changed  by 
CRAYEX  and  this  change  will  be  reflected  in  the  Cray-1 
vector  length  register.  On  return  from  CRAYEX  to  the  simu- 
lator VI,  must  be  in  the  range  of  1 to  64. 

EXSW  - This  return  parameter  is  of  type  logical.  On  return  to  the 
simulator  from  CRAYEX,  if  EXSW  is  .TRUE.,  the  simulator  will 
perform  a normal  Cray-1  program  exit.  If  EXSW  is  .FALSE., 
the  simulator  will  treat  the  exit  instruction  as  a no-op 
and  continue  the  simulation  with  the  next  instruction.  On 
entry  to  CRAYEX,  EXSW  has  the  value  .FALSE.. 

In  addition  to  the  CRAYEX  calling  parameters,  the  CRAYEX  pro- 
gram may  access  Cray-1  memory  by  sharing  the  memory  common  block 
as  described  in  section  2.3.2.  This  permits  the  CRAYEX  routine 
to  perform  major  computation,  I/O  etc.,  directly  to  the  Cray-1 
memory. 
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The  following  example  is  a skeleton  CRAYEX  dispatcher. 


C 

C . . . 

C 

C . . . 


C 

C . . . 

100 


C 

c . . . 

200 


SUBROUTINE  CRAYEX  (UK,  AREG,  SREG,  VREG,  VL,  EXSW) 
LOGICAL  EXSW 
INTEGER  AREG (8),  VL 

DOUBLE  PRECISION  SREG(8),  VREG(64,8) 

DISPATCH  ON  THE  EXIT  CODE. 

GO  TO  (100,  200,  300,  ...),  UK 

EXIT  CODE  UNDEFINED  - TREAT  AS  NORMAL  EXIT 
EXSW  = .TRUE. 

RETURN 

EXIT  CODE  = 1. 


; do  exit  code  1 processing. 
RETURN 

EXIT  CODE  = 2 

I do  exit  code  2 processing. 
RETURN 


END 


The  user  may  define  exit  code  one  to  be  a SQRT  function,  exit 
code  2 to  be  COS  function,  etc.  Arguments  and  results  may  be  passed 
through  the  registers  or  memory  providing  considerable  flexibility 
in  the  algorithm  design  and  implementation. 
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2 . 5 Report  Generation 

The  Cray-1  simulator  produces  two  kinds  of  report  outputs, 

STAT  and  CPACT.  The  STAT  report  is  a summary  report  of  the  pro- 
gram's use  of  Cray-1  resources.  The  CPACT  report  is  a detailed 
report  of  Cray-1  resource  usage  at  each  clock  period  of  the  pro- 
gram's execution. 

2.5.1  STAT  report 

The  STAT  report  consists  of  three  sections: 

1)  Vector  Usage  Counts 

2)  Floating  Point  Result  Counts 

3)  Data  Traffic  Counts 

Timing  must  be  enabled  only  for  the  Vector  Usage  Counts  section. 

The  Vector  Usage  Counts  section  reports  the  program's  use  of 
the  Cray-1  vector  unit  resources.  Figure  2.5.1,  below,  shows  the 
Vector  Usage  Counts  table. 

U OF  M CPAT-1  SIKULATOE  (UN138)  THU  JOH  15/78 
VECTOR  OSAGE  COONTS 
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5.02 

Figure 

2.5.1 

- Vector 

Usage  Counts  Table 

Each  column  of  the  Table  represents  a different  vector  functional 
unit.  Left  to  right  the  units  are:  floating  point  add,  floating 
. point  multiply,  floating  point  reciprocal  approximation,  logical, 

shift,  integer  add  and  memory,  split  between  vector  loads  and  vector 
stores.  The  rows  of  the  table  represent:  unit  busy  time,  percent 
unit  busy  of  total  run  time,  the  number  of  results  produced  by  the 

?• 

i 
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unit,  the  number  of  vector  instructions  issued  to  the  unit  and 
the  average  vector  length  processed  by  the  unit. 

Five  other  statistics  are  printed  beneath  the  table:  the 
run  time  since  the  last  INIT  command  or  simulator  start  up,  the 
MFLOPS  (million  floating  point  operations  per  second)  for 
the  program,  the  composite  average  vector  length  over  all  vector 
units,  the  vector  unit  concurrency,  and  the  MIPS  rate. 

MFLOPS  is  calculated  over  all  floating  point  operations, 
both  vector  and  scalar.  It  is  computed  as  the  number  of  floating 
point  operations  divided  by  the  program  run  time  in  seconds. 

Concurrency  is  calculated  as  the  sum  of  all  vector  unit  busy 
times  divided  by  the  program  run  time.  It  is  a global  measure  of 
the  concurrent  use  of  the  Cray-1  vector  units. 

MIPS,  millions  of  instructions  per  second,  is  calculated  as, 
the  number  of  instructions  issued  divided  by  the  program  run  time 
in  seconds. 

The  Floating  Point  Result  Counts  section  reports  the  program's 
use  of  both  vector  and  scalar  floating  point  operations.  For  each 
entry  in  the  table  (Figure  2.5.2)  both  the  number  of  results  and 
it's  percentage  are  printed. 


FLOATING  POINT 

PESOLT 

COONTS 

ADDITION 

HULTIPLICATIOH 

RECIPROCAL 

TOTAL 

VECTOR 

(*) 

5530 

( 

43.4) 

7119 

( 

55.9) 

63 

( 

•'‘.5) 

12712 

( 99.9) 

SCALAR 

{%) 

5 

( 

0.0) 

5 

( 

0.0) 

8 

( 

0.1) 

18 

( 0.1) 

TOTAL 

(*) 

5535 

( 

43.5) 

7124 

( 

56.0) 

71 

( 

0.6) 

12730 

(100. 0) 

Figure  2.5.2  - Floating  Point  Result  Counts  Table 

Floating  point  additions  (and  subtractions)  and  reciprocals  are 
counted  directly  from  the  instructions  that  perform  them,  but  the 
multiplication  count  requires  some  adjustment  due  to  the  reciprocal 
approximation . 

Because  a reciprocation  on  the  Cray-1  is  an  approximation, 
two  additional  multiplications  must  be  done  to  get  a full  precision 
result.  One  of  these  multiplications  is  a reciprocal  iteration  and 
the  other  is  a standard  multiplication.  The  Cray-l  instruction  se- 
quence below  illustrates  the  scalar  instructions  used  to  obtain  a 
full  precision  scalar  reciprocal  (SI  = 1/S2) . 
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SI 

/HS2 

reciprocal 

approximation 

S2 

S2*IS1 

reciprocal 

iteration 

SI 

S2*S1 

extend  precision 

To  count  these  additional  multiplies  as  part  of  the  floating  point 
operation  count  would  overstate  this  count,  since  they  really  are 
part  of  a single  reciprocal  operation.  Consequently  these  two 
multiplies  have  been  deducted  from  the  multiply  count  in  the  table. 

This  adjustment  is  made  by  subtracting  the  number  of  detected 
reciprocal  iterations  from  the  number  of  standard  multiplications. 

A reciprocal  iteration  is  detected  through  the  issue  of  a 067,  166 
or  167  instruction.  The  sum  of  all  vector  and  scalar  floating 
point  operations,  shown  in  the  lower  right  corner  of  the  Figure 
2.5.2,  is  used  as  the  numerator  of  the  MFLOPS  calculation  discussed 
above . 

The  Data  Traffic  Counts  section  is  the  last  section  of  the 
STAT  report.  Tt  is  only  printed  if  the  FULL  option  is  specified  on 
the  STAT  command.  Figure  2.5.3,  on  the  following  page,  is  an  example 
of  the  Data  Traffic  Counts  section. 

This  section  reports  the  amount  of  data  traffic  on  the  major 
data  trunks  of  the  Cray-1.  To  aid  in  identifying  the  various  data 
paths  for  which  traffic  information  is  provided,  the  simulator  prints 
a block  diagram  of  the  Cray-1  central  processor  and  attaclies  path 
labels  to  each  of  the  data  paths.  These  path  labels  are  referenced 
on  the  left  hand  side  of  the  report  proceeding  a number,  representing 
the  number  of  operands  shipped  over  that  path.  The  data  paths  with 
arrows  are  uni-directional  whereas  the  paths  shown  dotted  are  bi- 
directional . 

The  left  most  column  of  the  figure  represents  the  Cray-1  computa- 
tional units  divided  into  three  groups;  vector,  scalar  and  address. 

The  floating  point  functional  units  are  assumed  to  be  shared  between 
the  vector  and  scalar  groups. 

The  center  column  of  the  figure  represents  the  Cray-1  register 
storage.  Top  to  bottom  these  four  register  groups  are  the  vector 
registers,  the  scalar  registers,  the  T and  B registers  and  the  address 
registers.  The  vertical  bi-directional  communication  paths  (shown  dotted) 
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Figure  2.5.3 


between  the  four  reqister  groups  are  used  for  inter-group  data 
transfers . 

The  right  hand  column  of  the  figure  represents  Cray-1  main 
memory.  MEMORY  is  shown  in  four  sections  only  for  the  purpose  of 
the  figure.  Any  register  group  may  reference  any  location  in  Cray-1 
main  memory. 

The  labeling  scheme  is  defined  as  follows: 


1) 


"A"  moans  address,  "S"  means  scalar  and 
vector . 


■v 


means 


For  example, 


2)  "O"  means  operands  and  "R"  means  result. 

3)  "X"  means  a bi-directional  data  path 

4)  "M"  means  the  path  is  a memory  path  used  by  the  three 
register  groups  tied  both  to  memory  and  a computational 
unit.  The  T and  B registers  communicate  only  with 
memory  and  other  register  groups. 

’SMO"  is  the  operand  data  path  to  the  scalar  registers 
from  memory,  where  "SO"  is  the  operand  data  path  to  the  scalar  com- 
putation units  from  the  scalar  registers. 

Below  the  data  path  portion  of  the  report,  four  other  statistics 
are  printed: 

1)  MTSC.  represents  the  number  of  miscellaneous  instructions 
executed  by  the  program  that  do  not  move  data  across  any 
of  the  paths  shown  in  the  figure  and  are  not  branch 
instructions.  The  instructions  counted  include:  op-codes 
2-4,  20-22,  40-43,  72-73. 

2)  BRANCHES  represent  the  number  of  branch  instructions 
executed  by  the  program  whether  the  branch  is  taken  or  not. 

3)  FETCHES  represent  the  number  of  parcel  buffer  fetches 
incurred  by  the  running  program. 

4)  ISSUES  represent  the  number  of  instructions  issued  by 
the  running  program. 

The  last  part  of  the  Data  Traffic  Counts  section  shows  nine 
percentage  and  ratio  calculations.  Each  of  these  are  discussed  below 
with  their  derivation. 
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1. 


Percent  of  vector  operands  supplied  by  cache. 

The  term  cache  refers  to  the  eight  Cray-1  vector  reg- 
isters. This  percentage  reflects  the  dominance  of  the 
cache  over  memory  in  supplying  vector  operands  to  the 
vector  units.  It  is  defined  as, 


VO-VMO 

VO 


* 100 


2.  Percent  of  total  vector  traffic  supplied  by  cache. 

This  percentage  is  similar  to  (1)  above,  but  also  includes 
the  effect  of  the  vector  results  data  traffic.  It  is 
defined  as. 


VO-VMO  -f-  VR-VMR  * 

VO  + VR 

3.  Percent  vector  results  of  total  results. 

This  percentage  is  a measure  of  the  vector-scalar  composi- 
tion of  the  program's  computation.  This  figure  reflects 
the  percentage  of  all  results  computed  in  vector  mode. 
Because  scalar  and  vector  instructions  can  execute  con- 
currently, this  figure  is  not  the  percentage  of  time 
spent  in  vector  mode.  This  figure  is  defined  as, 

YB * 100 

VR  + SR  + AR 

4.  Percent  vector  memory  traffic  of  total  memory  traffic. 

This  percentage  is  a measure  of  the  vector-scalar  composi- 
tion of  the  program's  memory  usage.  This  figure  reflects 
the  percentage  of  vector  traffic  to  and  from  the  main 
memory.  It  is  defined  as, 

VMO  -t-  VMR  * 

TMDT  + FETCH 
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FETCH  is  the  number  of  memory  words  read  into  the  instruc- 
tion parcel  buffers.  TMDT  is  the  total  memory  data  traffic 
and  is  defined  as, 

TMDT  = VMO+VMR  + SMO+SMR  + AMO+AMR  + HTO+BTR 

5.  F^atio  of  computation  traffic  to  memory  traffic. 

This  ratio  is  a measure  of  the  benefit  provided  by  the 
register  portion  of  the  Cray-1  memory  hierarchy  in  re- 
ducing the  main  memory  data  traffic.  If  this  ratio  was 
one  there  would  bo  no  benefit  in  having  the  registers, 
since  register  traffic  equals  memory  traffic.  Typically 
this  ratio  is  in  the  range  of  two  to  five  indicating 
that  the  registers  provide  a substantial  reduction  in 
main  memory  data  traffic.  This  ratio  is  defined  as, 

VO+VR  + SO+SR  + AO+AR 
TMDT 


6.  Ratio  of  vector  memory  operands  to  vector  memory  results. 

This  ratio  is  a measure  of  the  average  vector  operand  re- 
quirements of  the  program.  This  ratio  combined  with  t.he 
vector  memory  result  rate  (see  10  below)  and  the  algorithmic 
cxjmr>lexity  of  main  memory  usage  (the  computational  lifetime  of 
data  in  main  memory)  will  allow  the  algorithm  designer 
to  determine  the  mass  storage  I/O  data  rates  necessary 
to  keep  the  vector  arithmetic  units  constantly  busy.  This 
ratio  is  defined  as, 


vm 

VMR 

7.  Ratio  of  vector  unit  results  to  vector  memory  operands. 

This  ratio  is  a figure  of  merit  of  the  average  value  of 
main  memory  operands  in  the  computation.  A value  of  two 
would  imply  that  each  main  memory  operand  precipitates 
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two  arithmetic  operations.  This  ratio  is  defined  as, 


VR 

VMO 

8.  Ratio  of  vector  unit  results  to  vector  memory  results. 

This  ratio  is  similar  to  (5)  above,  but  compares  only 
vector  result  traffic.  This  reflects  the  average  number 
of  arithmetic  operations  necessary  for  the  production 
of  one  stored  result.  The  ratio  is  defined  as, 

VMR 

9.  Ratio  of  vector  unit  cache  use  to  memory  unit  cache  use. 

Like  (5)  above,  this  ratio  measures  the  benefit  of  main 
memory  bandwidth  reduction  provided  by  the  registers  in 
the  memory  hierarchy.  However,  this  ratio  only  compares 
the  data  traffic  into  and  out  of  the  cache.  it  is  defined 
as , 

VO  VR 
VMO  f VMR 

10.  Vector  memory  result  rate. 

This  is  the  rate  at  which  the  program  sends  vector  results 
to  main  memory.  This  is  useful  for  assessing  mass  storage 
I/O  data  rates  as  mentioned  in  (6)  above. 

VMR 

RTC 

The  ratios  and  percentages  above  tended  to  emphasize  the  vector 
portion  of  the  Cray-1.  Clearly,  similar  figures  could  be  calculated 
for  the  scalar  and  address  portions  of  the  machine.  The  raw  data  is 
provided  as  part  of  the  Data  Traffic  Counts  report  so  that  the  user 
may  calculate  scalar  figures  or  devise  other  measures  of  algorithm- 

processor  performance. 
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2.5.2 

The  CPACT  report  produces  a detailed  clock  period  activity 
record  of  the  Cray-1  machine  state.  This  132  column  report 

suitable  only  for  firintinq  on  a line  printer.  Fiqure  2.5.4  on 
the  followinq  paqe  shows  the  format  of  the  report.  Across  the 
to[j  of  the  report,  the  various  column  headinqs  are  devoted  to  the 
Cray-1  resources  that  may  be  called  into  use  by  a Cray-1  instruc- 
tion. Time  flows  down  the  paqe  with  each  clock  period  of  simulation 
time  producinq  an  output  record  that  describes  the  state  of  Cray-1 
resources  at  that  clock  period.  With  vector  instructions  usinq 
lonq  vector  lenqths,  the  machine  resource  state  may  remain  unchanqed 
for  fifty  or  more  clock  periods,  resultinq  in  many  identical  CF’ACT 
output  records.  The  COMPRESS  option  on  the  CPACT  command  (see 
section  4)  may  be  used  to  suppress  the  printinq  of  ten  or  more 
identical  output  records.  This  substantially  reduces  simulation 
cost  and  makes  the  CPACT  report  far  more  manaqeable.  One  line  of 
compression  dots  are  printed  in  place  of  the  suppressed  records. 

The  CPACT  report  is  partitioned  into  the  followinq  21  Cray-1 
resource  fields: 

1.  ST.  - The  machine  state  field. 

This  field  indicates  the  machine  state  at  each  clock 
period.  Three  possible  entries  are:  (1)  "IS",  which 
means  that  an  instruction  is  issuinq  at  this  clock 
period,  (2)  blank  which  means  that  no  instruction  will 
issue  at  this  clock  period,  and,  (3)  "FE",  which  means 
a parcel  buffer  fetch  sequence  is  initiated  at  this 
clock  period. 


2.  TAG  - The  activity  resource  taq . 

At  a clock  period  in  which  a new  machine  activity 
(instruction  issue  or  fetch  request)  is  initiated,  the 
activity  is  assiqned  a one  letter  activity  resource 
taq  (A-Z,  )?-9)  which  is  used  in  subsequent  clock  periods 
to  identify  the  Cray-1  resources  called  into  use  by  the 
initiated  activity.  When  a conflict  occurs  in  the  demand 


*A  narrow  version 
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Figure  2.5.4  - CPACT  Report  Format 


for  a Cray- 1 rosrnjrcG,  the  taq  occupyinfj  the  resource 
may  be  tracer]  back  to  the  initiatinrj  activity. 

Resource  conflict  occurs  when  an  activity  initiated 
in  a fjast  clock  per iod , occupies  a Cray-1  resource 
that  is  now  beinrj  demanded  by  another  activity.  For 
example,  when  an  arithmetic  instruction  issues,  the 
result  rerjister  is  reserved  until  the  result  arrives 
at  the  rerjister.  Because  the  Cray-1  is  pipe-lined, 
a subserjuent  instruction,  that  rerjuires  the  previous 
arithmetic  result  as  an  input  operand,  may  experience 
an  operanrl  rerjister  conflict,  causincj  it  to  hold  issue 
until  the  previous  arithmetic  result  arrives  at  the 
operand  rerjister  (the  previous  instruction's  result 
rerjister).  In  this  example  the  resource  conflict  occurs 
on  a rerjister.  The  CPACT  report  will  show  the  first  in- 
structirjn's  activity  tarj  in  the  report  column  crjrrespond inrj 
to  the  result  reqister  of  the  instruction.  The  tarj  will  re- 
main in  this  column  until  the  result  reqister  reservation  ex- 
fjires  (i.e.,  the  rlata  has  arriveri)  . If  the  secrjnd  instruction 
rif^rnanrls  thr;  use  rjf  this  result  rerjister  before  the  reser- 
vation has  oxpirerJ,  the  result  rerjister  reservation  taq 
will  be  underscorerl  and  the  second  instruction  will  hold 
issurj  until  the  rlata  arrives. 

The  underscorinq  of  activity  tags  is  used  throughout  the 
report  to  highlight  the  resource  conflicts  of  waiting 
instructions . 

3.  IN.STRUCTTON  - The  Mnemonic  for  the  issuing  instruction. 

When  a Cray-1  instruction  issues  ("IS"  in  machine  state 
field),  the  assembly  mnemonic  for  the  instruction  is 
printed  in  this  column. 

4.  P-ADDR  - The  jjarcel  address  of  the  issuing  instruction. 

When  a Cray-1  instruction  issues,  the  parcel  address 
from  where  it  came  is  printed  in  this  column. 
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5.  CP  - The  simulator  clock  period. 

This  column  contains  the  simulator  clock  period.  The 
clock  period  is  reset  to  zero  by  an  INIT  command.  If 
the  user  turns  timing  on  and  off  through  the  SET  command 
or  the  ERT,  DRT  instructions,  the  clock  period  is  not 
affected  but  the  machine  resource  state  is  cleared. 

6.  +*/^>+  - The  Cray-1  vector  functional  units. 

Each  of  those  six  columns  represent  the  reservation 
state  of  a Cray-1  Vector  functional  unit.  Left  to 
right  the  units  are  : floating  point  adder,  floating 
point  multiplier,  floating  point  reciprocal  approxi- 
mation, vector  logical,  vector  shift,  vector  integer 
adder.  The  activity  tag  of  a vector  instruction  which 
reserves  one  of  these  functional  units  will  be  placed 
in  the  corresponding  column. 

The  vector  memory  path  can  also  be  reserved  by  a vector 
instruction  and  is  shown  in  one  of  the  far  right  columns 
under  the  heading  "BSE",  which  stands  for  block  seguence 
flag.  This  flag  is  set  during  all  vector  memory  refer- 
ences 

7.  V.  Reg  - The  eight  Cray-1  vector  registers. 

Each  of  these  eight  columns  represent  the  reservation 
state  of  a Cray-1  vector  register.  Vector  registers 
are  reserved  by  the  vector  instructions  which  reference 
them  either  as  operand  or  result  registers.  The  activity 
tag  of  thdvissuing  vector  instruction  will  be  placed  in 
the  columns  corresponding  to  the  vector  registers  used 
by  the  instruction.  Operand  registers  arc  typically 
reserved  for  MAX  (VL,5)  clock  periods.  Result  registers 
are  typically  reserved  for  MAX{VL,5)  +FUT  + 2 clock 
periods,  whore  FLT  is  the  functional  time  of  the  vector 
unit  performing  the  vector  operation. 
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If  a subsequent  vector  instruction  requires,  as  an 
input  vector,  the  result  vector  of  a previous  vector 
instruction,  and  is  ready  to  issue  when  the  prior 
instruction's  first  result  arrives  at  it's  vector 
register  (the  first  result  will  arrive  in  FUT  f 2 
clock  periods  after  issue) , then  the  second  vector 
instruction  will  issue  only  at  the  clock  period  when 
this  first  result  arrives.  This  is  called  chaining 
and  the  clock  period  when  the  first  result  arrives 
is  called  chain  slot  time.  If  the  second  vector 
instruction  misses  chain  slot  time,  it  will  hold  issue 
until  all  results  of  the  first  instruction  have 
arrived  at  the  vector  register. 

If  the  second  vector  instruction  chains  to  the  first, 
the  activity  tag  of  the  second  instruction  will  replace 
the  tag  of  the  first  instruction  in  the  chained  vector 
register  column.  If  chain  slot  time  is  missed,  an 
asterisk  is  placed  in  the  result  register  field  at  the 
chain  slot  time  clock  period,  highlighting  chain  slot 
time . 

See  appendix-A  for  a summary  of  Cray-1  timing  informa- 
tion. 

MEMORY  BANKS  - The  Cray-1  rank  registers  and  memory  banks. 

This  portion  represents  the  Cray-1  scalar  memory  refer- 
ence access  network  and  the  16  Cray-1  memory  banks.  The 
memory  bank  cycle  time  of  the  Cray-1  is  four  clock  per- 
iods long.  Consequently^  memory  accesses  to  the  same 
bank  must  be  at  least  four  clock  periods  apart.  Two  scalar 
memory  references  which  could  address  the  same  bank  can 
issue  tv70  clock  periods  apart.  This  would  give  rise  to  a 
bank  conflict  which  is  resolved  by  the  scalar  rank  register 
access  network.  (I/O  access  to  main  memory  also  passes 
through  the  rank  registers) . The  four  columns  to  the  left 
of  the  16  memory  bank  columns  represent: 
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SCI  - Scalar  in  clock  period  one. 

RKA  - Rank  register  - A 
KKB  - Rank  register  - B 
RKC  - Rank  register  - C 

When  a scalar  memory  reference  issues,  it's  activity 
tag  is  placed  in  the  column  SCI,  It's  bank  address 
(lower  4 bits)  is  then  compared  to  the  bank  addresses 
in  rank  registers  A,  B and  C.  If  a bank  coincidence 
is  detected,  the  memory  address  waits  at  SCI  until  a 
clock  period  arrives  when  bank  coincidence  vanishes. 
Meanwhile  the  bank  addresses  in  the  rank  registers  are 
advanced  each  clock  period  to  the  next  rank  register. 

The  address  in  rank-C  advances  to  it's  target  memory 
bank  and  remains  latched  at  that  bank  for  four  clock 
periods.  On  the  fifth  clock  period,  the  memory  data 
is  gated  from  the  bank  into  the  SEC-DED  (single  error 
correction  - double  error  detection)  network.  Simul- 
taneously a new  memory  address  may  be  latched  onto  the 
bank  to  start  the  next  reference. 

While  the  address  is  waiting  in  SCI,  the  activity  tag 
of  the  issuing  instruction  is  placed  in  one  of  the  far 
right  columns  labeled  "STH",  which  means  storage  hold. 
While  a scalar  memory  reference  is  waiting  in  storage 
hold,  subsequent  scalar  memory  references  may  not  issue 
until  the  waiting  scalar  reference  leaves  the  storage 
hold  state.  This  means  that  two  scalar  memory  reference 
instructions,  accessing  the  same  bank,  may  issue  so  as 
not  to  block  subsequent  instructions  from  issuing.  But, 
if  a third  scalar  memory  reference  tries  to  access  the 
same  bank  as  the  first  two  references,  the  storage  hold 
state  of  the  second  scalar  reference  will  block  issue 
of  the  third  scalar  reference  which  blocks  all  instruc- 
tion issuing. 

As  the  memory  address  advances  through  the  rank  registers, 
the  activity  tag  of  the  issued  memory  reference  is  ad- 
vanced to  the  right.  When  the  tag  leaves  rank-C,  it 
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will  jump  to  its  tarqet  bank  and  remain  there  for 
four  clock  periods. 

Only  scalar  memory  references  place  tags  in  this 
section  of  CPACT.  Parcel  buffer  fetches,  B and  T- 
roqistor  transfers  and  vector  references  place  no 
tags  in  this  section.  They  do  affect  other  columns 
of  the  report,  though. 

9.  ARA  - The  A-register  access  path  busy  flag. 

There  is  a single  store  access  path  to  the  eight 
Cray-1  address  registers.  Each  clock  period,  one 
operand  may  be  stored  into  one  of  the  eight  A-registers 
via  this  path.  When  an  instruction  tries  to  issue,  if 
the  result  of  its  computation  would  make  use  of  the 
A-register  access  path  at  a future  clock  period  when 
the  path  is  already  reserved  for  use  by  a prior  instruc- 
tion, the  issue  will  be  held  until  the  next  clock  period. 
When  an  instruction  uses  the  access  path  its  activity 
tacj  will  appear  for  one  clock  period. 

10.  A.  Reg  - The  eight  Cray-1  address  registers. 

These  eight  columns  correspond  to  the  eight  Cray-1 
A-registers.  When  an  instruction  reserves  an  A-register 
its  activity  tag  will  appear  in  the  appropriate  column. 

11.  SRA  - The  S-register  access  path  busy  flag. 

This  flag  serves  the  same  function  for  the  eight  S-registers 
that  the  ARA  flag  does  for  the  A-registers. 

12.  S.  REG  - The  eight  Cray-1  scalar  registers. 

These  eight  columns  correspond  to  the  eight  Cray-1 
S-registers.  When  an  instruction  reserves  an  S-register 
its  activity  tag  will  appear  in  the  appropriate  column. 
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13. 


VM  - The  voctor  mask  busy  flaq. 

This  flaq  is  sot  when  a 003  instruction  (VM  Sj)  or 
a 175  instruction  (VM  Vj,C)  is  in  [iroqross.  The 
activity  taq  of  the  issuinq  instruction  appears  in 
this  column. 

14.  AOB  - AO  busy  flaq. 

The  A-roqister  conditional  branch  instructions,  010- 
013,  use  the  data  in  AO  to  make  branch  decisions. 

When  new  data  is  stored  AO  it  takes  two  additional 
clock  periods  to  validate  the  branch  test  flags.  While 
the  branch  test  flaqs  are  invalid,  the  A-register  con- 
ditional branch  instructions  will  hold  issue.  The 
activity  tag  of  the  instruction  storing  new  data  into 
AO  will  appear  in  this  column  until  the  branch  test 
flags  are  made  valid. 

15.  SOB  - SO  busy  flaq. 

The  same  comments  for  AOB  above  apply  here,  except 
the  S-reqister  conditional  branch  instructions  (014- 
017)  are  affected. 

lf>.  STB  - .Storage  hold  flag. 

The  activity  taq  for  a scalar  memory  reference,  whose 
address  experiences  a memory  bank  conflict  with  the 
rank  registers  is  placed  in  this  column  until  the  con- 
flict vanishes.  Subsequent  scalar  memory  references 
will  hold  issue  until  this  flag  clears.  See  the  memory 
banks  discussion  for  more  details. 

17.  073  - Vector  mask  read  inhibit  flag. 

Execution  of  a 003  instruction  (VM  Sj)  or  a 175  instruc- 
tion (VM  Vj,C)  will  cause  a 073  instruction  (Si  VM)  to 
hold  issue  until  the  003  or  17b  finishes.  The  activity 
tag  of  the  003  or  175  instruction  will  appear  in  this  column. 
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IR.  nC('.  - Tht!  parcel  buffer  chanqo  flnq. 

When  fche  next  instruction  parcel  to  enter  the  NIP 
(next  instruction  parcel)  retjister  is  not  in  the 
current  parcel  buffer,  duo  to  a branch  or  a buffer 
fall  throuqh,  this  column  will  be  taqqed  with  an 
asterisk  until  the  buffer  chanqe  is  completed.  This 
may  involve  a switch  to  one  of  the  other  parcel  buffers 
or  a parcel  buffer  memory  fetch  may  be  needed.  This 
flaq  causes  all  instructions  to  hold  issue. 

19.  FPA  - Fetch  fiause  flaq. 

This  flaq  is  used  to  triqqer  a parcel  buffer  fetch 
secjuence.  While  it  is  up^an  asterisk  appears  in  this 
column.  The  fetch  sequence  will  beqin  when  this  flaq 
is  clear. 

20.  BSF  - The  vector  memory  reference  block  setjuence  flaq. 

When  a vector  memory  reference  (176,177)  issues,  this 
column  will  be  set  with  the  activity  taq  of  the  issuinq 
instruction.  Subsequent  vector  memory  references  will 
hold  issue  until  this  flag  clears. 

21.  BTX  - The  B and  T register  block  transfer  flag. 

When  a B or  T register  block  transfer  instruction  (034- 
037)  issues,  its  activity  flag  will  appear  in  this 
column.  No  other  instruction  may  issue  while  a B or  T 
block  transfer  is  in  progress. 


2.5.3  CPACT  Examples 

To  illustrate  the  use  of  the  various  CPACT  report  fields  two 
examples  are  presented,  a scalar  memory  reference  example  and  a 
vector  example. 

2. 5. 3.1  Scalar  Example 

The  CPACT  report  shown  in  figure  2.5.5  (wide  and  narrow  versions) 
was  programmed  by  the  Cray-1  program  below 


S3 

53,0 

(B) 

A3 

33,0 

(C) 

AO 

103,0 

(D) 

SO 

51,0 

(E) 

S6 

50,0 

(F) 

S5 

47,0 

(G) 

JSZ 

24C 

(H) 

EX 

000 

(I) 

The  labels  to  the  left  are  the  parcel  addresses  of  the  associated 
instructions.  The  letters  in  parentheses  on  the  right,  are  the 
activity  tags  for  the  corresponding  instruction  as  assigned  by 
the  CPACT  report.  These  tags  will  be  used  to  refer  to  the  instruc- 
tions in  the  discussion  below. 

When  the  program  was  first  started  up,  no  instruction  parcels 
were  in  the  parcel  buffers  so  a fetch  was  required.  The  asterisk 
in  the  BCG  field  indicates  a buffer  change  in  process.  When  the 
first  instruction  parcel  (20A)  arrives  at  the  parcel  buffer,  two 
pass  instructions  ("<BLANK>")  issue  which  pull  the  parcel  through 
the  NIP  to  the  CIP  register.  Instruction  B issues  at  clock  period 
17  (CP  17)  and  its  activity  tag  appears  in  SCI  to  indicate  a scalar 
memory  reference  in  clock  period  one.  As  time  advances  the  B tag 
propagates  through  the  rank  registers  and  onto  memory  bank  13  octal 
(B  hexadecimal) . After  four  clock  periods  on  the  memory  bank,  the 
B tag  vanishes  and  appears  later  at  clock  period  27.  Here  the 
instruction  makes  use  of  the  S-register  access  path  so  the  memory 
data  can  be  stored  into  the  result  register,  S3  in  this  case. 
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Figure  2.5.5  - CPACT  Scalar  Example,  Wide  version 


CRAV-l  SIMULATOR  (UG088)  AUG  22/78  PAGE 


42 


Figure  2.5.5  - CPACT  Scalar  Example,  Narrow  Version 


During  the  time  period  from  instruction  issue  (CP  27)  until  the 
data  arrives  at  the  result  register  (CP  27) , the  B tag  appears 

in  the  S-register  3 column  showing  the  reservation  on  S3.  Because  < 

a scalar  memory  reference  instruction  is  a two  parcel  instruction, 
a BLANK  Will  issue  after  it.  This  is  true  for  all  two  parcel 
instruct  ions. 

When  scalar  reference  instruction  C issues,  a bank  conflict 
with  the  previous  instruction  is  detected  (address  33  and  53  are 

in  the  same  bank) . This  causes  the  reference  to  enter  the  storage  ' 

hold  state  until  the  conflict  vanishes,  meanwhile  the  C tag  appears 
in  the  STH  column.  Scalar  reference  instruction  D will  try  to  issue 
at  CP  21,  but  can't  because  the  storage  hold  flag  is  set.  This  i 

is  noted  by  the  underscore  beneath  the  C tag  in  the  STH  column. 

The  subsequent  four  scalar  references  (D,E,F  and  G)  all 
reference  available  memory  banks  and  issue  consecutively  without  > 

conflict.  Instruction  E loads  register  SO  which  is  needed  by 
branch  instruction  H to  decide  the  branch  outcome.  Even  though 
the  data  for  E arrives  at  CP  35,  two  more  clock  periods  are  required 
until  the  branch  condition  flags  become  valid.  While  the  branch 
flag  is  invalid  the  E tag  appears  in  the  SO  busy  column  (SOB) . Once 
the  SOb  flag  clears,  the  branch  instruction  issues  at  CP  38.  This 
branch  instruction  has  an  in-buffer  target  address.  While  the 
buffer  change  is  in  progress,  the  instruction  causing  the  change 
will  place  its  tag  in  the  BCG  column.  Once  the  change  is  complete, 

BCG  is  cleared  and  two  blanks  issue  to  load  the  CIP  register. 
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2. 5. 3. 2 Vector  Example 

This  example  illustrates  the  CPACT  report  with  vector  instruc- 
tions. All  vector  lenqths  are  seven.  Figure  2.5.6  is  the  CPACT 
report  generated  by  the  following  program: 


AO 

50 

(B) 

A1 

7 

(C) 

VL 

A1 

(D) 

.SI 

43,0 

(E) 

VO 

A0,1 

(F) 

VI 

A0,1 

(G) 

V2 

Vl+VO 

(H) 

VM 

V2,Z 

(I) 

SO 

VM 

(J) 

JSN 

4 0A 

(K) 

KX 

COO 

(M) 

As  in  the  scalar  example,  the  simulation  begins  with  a fetch 
sequence.  The  first  vector  instruction  (F)  loads  a vector  from 
memory  into  VO.  Rank-B  and  rank-C  busy  are  hold  issue  conditions 
for  this  vector  load  and  are  shown  underscored.  Once  the  vector 
load  issues,  it  places  its  tag  both  in  the  VO  busy  and  the  block 
sequence  flag  (BSF)  columns.  Vector  instruction  G is  also  a vector 
memory  load,  but  it  must  hold  issue  until  BSF  clears.  The  asterisk 
in  the  VO  column  at  clock  period  33  represents  the  chain  slot  clock 
period  for  instruction  F.  If  a vector  instruction  using  VO  as  an 
operand  was  placed  after  instruction  F and  it  meets  all  other  con- 
ditions at  CP  33,  it  will  issue  at  CP  33,  and  be  chained  to  F.  At 
CP  35  BSF  clears,  allowing  instruction  G (a  second  vector  memory 
load)  to  issue. 

Instruction  H is  a vector  integer  add  with  vector  registers  VI 
and  VO  as  operands.  Consequently,  the  busy  state  of  VI  and  VO  are 
hold  issue  conditions  for  H.  The  functional  unit  time  for  the  vector 
load  (G)  is  seven  clock  periods,  so  at  CP  44  , (issue  time)  + (functional 
unit  time)  + (2),  chain  slot  time  will  occur  and  the  vector  add 
issues.  The  tag  for  the  chaining  instruction  (H)  replaces  the  result 
tag  (G)  on  the  chained  register. 
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Figure  2.5.6  - CPACT  Vector  Example  "continued 


Instruction  I chains  in  a similar  way  to  the  result  of  instruc- 
tion H.  The  vector  mask  register  is  reserved  by  I and  its  tag  is 
placed  in  the  VM  column. 

Instruction  J will  hold  issue  until  the  073  inhibit  flag  clears. 
SO  is  used  as  data  for  branch  instruction  K which  does  a branch  out 
of  buffer,  causing  a fetch  sequence. 
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2 . 6 


Inconsistencies  with  the  Cray-1 

In  this  section,  simulator  behavior  that  is  known  to  be 
inconsistent  with  the  Cray-1  will  be  discussed. 

1)  The  simulator  does  not  simulate  Cray-1  I/O. 

2)  No  Cray-1  monitor  instructions  are  simulated. 

3)  The  Cray-1  exchange  mechanism  is  not  simulated. 

4)  Recursive  use  of  vector  registers  is  not  supported 
for  all  vector  instructions.  Recursive  use  is  avail- 
able for  instructions  141,  143,147  and  171. 

5)  Cray-1  integer  multiply  of  64  bit  operands  is  performed 
by  the  floating  point  multiply  unit  when  the  exponent 
field  is  found  to  be  zero.  This  behavior  is  not 
currently  supported  in  the  Cray-1  simulator.  Depending 
on  the  need  for  it,  it  may  be  added  later. 

6)  Integers  on  the  Cray-1  are  24  bits  wide.  Integers 
in  the  simulator  are  32  bits  wide. 

7)  The  simulator  floating  point  format  (IBM  360/370) 
differs  from  the  Cray-1  format.  (Sse  section  2.1.4) 

8)  The  071X2X  instruction  (Si  +AK)  produces  a normalized 
result  in  the  simulator.  Not  so  on  the  Cray-  1 . 

9)  Though  the  timing  of  sizeable  algorithms  has  been  close 
to  the  Cray-1  timing,  with  an  error  in  the  1/2%  range, 
it  is  not  exact.  The  source  of  this  timing  error  is 
presently  unknown. 
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3 . Examples 

This  section  is  devoted  to  the  discussion  of  two  sample 
Cray-1  programs.  The  first  example  illustrates  the  Cray-1 
simulator  in  standalone  mode  (no  user  main  program)  with  a user 
supplied  exit  processor  (CRAYEX) . The  second  example  illustrates 
calling  the  simulator  by  a user  supplied  main  program. 

3 . 1 Stand-alone  use  with  an  exit  processor 

This  example  illustrates  the  use  of  a user  supplied  exit 
processor.  Entry  to  the  exit  processor  is  triggered  by  executing 
a Cray-1  exit  instruction  (004ijk,  mnemonic  EX  ijk) . As  described 
in  section  2.4,  the  Cray-1  simulator  will  call  the  subroutine 
CRAYEX  with  both  the  ijk  field  of  the  instruction  and  the  Cray-1 
registers  as  arguments.  If  the  user  supplies  a CRAYEX  subroutine, 
this  routine  will  get  control  on  all  exit  instructions  which  have 
non-zero  ijk  fields. 

Appendix  H shows  a Fortran  listing  of  an  exit  processor  used 
to  perform  initialization  and  output.  The  Cray-1  program  that 
uses  this  exit  processor  is  a full  matrix  LU  decomposition  program. 

When  this  program  executes  an  "EX  1"  instruction,  CRAYEX  gets 
control  and  initializes  the  full  matrix  which  the  program  will 
decompose.  Register  A1  contains  a pointer  into  the  simulated 
Cray-1  memory  where  the  matrix  will  start.  Register  A2  contains 
the  size  of  the  square  matrix.  The  /MEMORY/  common  block  allows  the 
CRAYEX  routine  to  share  the  simulated  Cray-1  memory.  The  user  could 
equivalence  other  program  arrays  to  the  MEM  array  allowing  conven- 
ient array  sharing  between  the  user  Fortran  program  and  the 
simulator  memory. 

When  the  program  executes  an  "EX  2"  instruction,  CRAYEX  gets 
control  and  prints  both  the  matrix  size  and  the  run  time.  Register 
A7  contains  the  matrix  size  and  register  S7  contains  the  run  time 
as  sampled  by  the  Cray-1  program.  If  the  logical  variable  MOANS  is 
set  to  .FALSE.,  the  decomposed  matrix  will  be  printed.  This  is  a 
way  to  interface  the  Cray-1  program  with  Fortran  I/O. 

A typical  terminal  session  of  this  example  is  shown  on  the 
following  page. 
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4 , Simulator  Command  Descriptions 

This  section  describes  each  of  the  simulator  commands  in  detail. 

Each  command  may  be  abbreviated  and  the  minimum  acceptable  abbrevia- 
tion is  underlined.  Each  command  description  has  the  following  format 

(1)  Purpose  - The  function  of  the  command. 

(2)  Prototype  - The  parameter  syntax  for  the  command. 

(3)  Description  - A detailed  description  of  the  command. 

(4)  Examples. 

(5)  Error  Responses  - The  possible  error  responses. 

The  following  syntax  is  used  to  describe  the  commands: 

Upper  case  characters  must  appear  exactly  as  shown. 

Lower  case  characters  represent  generic  parameter  names  which 
must  bo  replaced  with  the  actual  parameters. 

Where  blanks  appear  one  or  more  blanks  must  appear. 

.Scjuaro  brackets  are  used  to  denote  optional  parameters. 

Ellipsis  notation  (...)  is  used  to  denote  the  repetition  of  a 
parameter  list. 

Vertical  bars  are  used  to  separate  parameter  alternatives. 

The  following  error  responses  apply  to  all  commands: 

Command  length  > 80  bytes  - 

All  commands  must  be  less  than  or  equal  to  80  characters 
in  length.  However,  a simulator  subroutine  call  may  pass 
a segmented  set  of  commands  whose  combined  length  may  not 
exceed  200  bytes.  Each  individual  command  though  may  not 
exceed  80  bytes. 

Command  string  terminator  "!"  not  found  - 

On  a simulator  subroutine  call  the  supplied  command  string 
did  not  contain  an  exclamation  point  within  the  first  200 
bytes . 
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4 . Simulator  Command  Descriptions 
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AT 

Command  Description 

Purpose  : To  set  an  AT  point  at  a selected  parcel  address 

Prototype  : AT  p-addr  [skip-count] 

; commands  to  process  when  the  AT  point  is  hit. 

KND 

Description : 

An  AT  point  is  set  at  the  specified  parcel  address.  This  address 
may  bo  in  modified  octal  format,  which  is  the  octal  word  address  follow- 
ed by  a parcel  code  A,B,C,  or  D,  or  may  be  a symbol  with  parcel  address 
attribute  defined  by  the  assembly  language  program.  After  the  AT  com- 
mand is  entered,  the  command  language  will  read  more  command  input. 

These  commands  are  written,  unprocessed,  into  a scratch  file.  To  ter- 
minate input,  enter  the  string  "END"  on  a single  line.  AT  points  set 
at  the  lower  parcel  of  a two  parcel  instruction  are  ignored. 

When  the  AT  point  is  hit,  the  scratch  file  will  be  opened  and 
subsequent  commands  read  from  that  file.  The  AT  file  is  terminated 
with  a RUN  command  which  will  resume  the  simulation  automatically. 

To  regain  control  when  the  AT  point  is  hit,  you  must  enter  the 
command  "USE  *MSOURCE*"  when  setting  up  the  AT  point. 

An  optional  skip-count  may  be  provided  when  first  setting  the 
AT  point.  This  is  a positive  decimal  number  which  indicates  the 
number  of  times  the  simulator  is  to  ignore  the  presence  of  this  AT 
point.  When  this  count  expires,  the  AT  point  will  be  recognized 
and  processed  as  above. 

When  the  AT  point  is  hit,  the  instruction  at  the  p-addr,  where 
it  is  set,  will  not  have  been  executed. 

Examples : 

AT  21A 

DlSPTJiY  S0  /\fl  M(33) 

END 

When  the  AT  point  is  hit,  registers  S0 , A0  and  memory 
location  33  octal  are  displayed. 
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AT  Command 


AT  245b  31 
D M(0) ,10 
USE  *MSOURCE* 

END 

An  AT  point  is  set  at  location  245B.  The  first  31  (decimal) 
times  the  instruction  is  executed,  the  AT  point  is  skipped.  Before 
the  instruction  is  executed  again  the  AT  point  takes  control  and 
displays  memory  locations  through  10  (octal) . Control  is  then 
given  to  the  user  terminal. 

AT  SUBRI 
change  A5  A7 
END 

This  has  the  effect  of  patching  a new  instruction  (A5  A7) 
at  location  with  label  .GUBRI  (i.e.,  A7  is  stored  into  A5)  . 

Error  Responses: 

Invalid  p-addr  - 

The  p-addr  is  unrecognizable  or  out  of  range  of  the  current 
memory  size. 

Undefined  Symbol  - 

The  symbol  specified  is  not  defined  in  the  current  (IDENT) 
module . 

Symbol  does  not  reference  a parcel  address  - 

The  symbol  has  word  address  or  value  attribute. 

Invalid  Skip  count  - 

Skip  count  unrecognizable  or  negative. 

No  room  for  more  Break  or  At  points  - 

Only  forty  Break  or  At  points  may  be  set  at  any  one  time. 

Break  or  At  point  already  set  here  - 

A Break  or  At  point  is  already  set  at  this  p-addr. 

Can't  open  AT  point  file  - 

The  command  language  was  unable  to  open  the  AT  file  for 
saving  the  commands.  This  could  occur  if  the  MTS  scratch 
file  character  is  changed  from  a minus  sign. 
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AT  COMMAND 


BREAK 


Corrunand  Description 

Purpose  : To  control  program  flo/^  by  setting  break  points. 

Prototype  : BREAK  p-addr  [skip-cnt] 

Description : 

A break  point  is  set  at  the  indicated  parcel  address.  The  parcel 
address  may  be  specified  in  a modified  octal  format;  an  octal  word  add- 
ress followed  by  a parcel  code  A,B,C  or  D,  or  may  be  a symbol  with  par- 
cel address  attribute.  An  optional  decimal  skip  count  may  be  specified 
and  will  cause  the  break  point  to  be  ignored  the  indicated  number  of  times 
The  effect  of  hitting  a break  point  is  equivalent  to  issuing 
the  command  "USE  *MSOURCE*".  Continuation  from  a break  point  is  accom- 
plished by  entering  a RUN  command.  An  end-of-file  condition  from  the 
terminal  (by  $ENDFIEE  or  control-c)  will  cause  the  command  stack  (see 
section  2.1.1)  to  be  popped.  This  allows  further  commands  to  come 
from  a prior  USE  command  or  a subroutine  call  command  string.  If  a 
RUN  command  is  entered  without  any  parameters,  the  remaining  issue 
limit  is  used  and  simulation  continues  with  the  broken  instruction. 

No  timing  information  is  lost  and  no  additional  time  is  required.  If 
a p-addr  is  specified  on  the  RUN  command,  an  instruction  buffer  fetch 
is  forced  and  this  will  alter  the  timing. 

When  the  break  point  is  hit,  the  broken  instruction  has  not  been 
executed.  Break  points  do  not  modify  the  Cray-1  memory,  so  they  may 
be  set  before  the  program  is  loaded.  A maximum  of  forty  BREAK  and  AT 
points  may  be  set.  Break  points  on  the  lower  parcel  of  a two  parcel 
instruction  are  ignored.  The  user  is  not  notified  of  this. 

Examples : 

BREAK  25A 
BR  13B  18 

B LABI 

Error  Responses: 

Invalid  p-addr  - 

The  p-addr  is  unrecognizable  or  out  of  range  of  the  current 
memory  size. 
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BREAK  command 


Invalid  skip  count  - 

Skip  count  unrecognizable  or  negative. 

No  room  for  more  BREAK  or  AT  points  - 

Only  forty  BREAK  or  AT  points  may  be  set  at  any  one 
t ime . 

Break  or  At  point  already  set  here  - 

A BREAK  or  AT  point  is  already  set  at  this  p-addr. 
Undefined  Symbol  - 

The  symbol  specified  is  not  defined  in  the  current  (IDENT) 
module. 

Symbol  does  not  reference  a parcel  address  - 

The  symbol  has  word  address  or  value  a'  -ibute. 


BREAK  command 
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CALCULATF 


Command  description 

Purpose  ; To  calculate  integer  offsets  for  memory  displacements 
Prototype  : CALCULATF,  < symbol ><op>< symbol ^ <op>^ symbol,> 

Descr ipt i on : 

The  integer  expression  is  evaluated  strictly  left  to  right.  Only 
four  operators  (<'op>)  are  allowed:  The  result  is  displayed 

in  decimal^  octal,  and  modified  octal.  The  operands  (<symbol>)  may  be 
replaced  with  any  pro-defined  or  user  defined  symbol  (see  the  DISPLAY 
command)  or  a constant  as  follows 

nnn  - for  an  octal  integer  constant 
O'nnn'-  for  an  octal  integer  constant 
D'nnn'-  for  a decimal  integer  constant. 

All  operands  are  interpreted  as  integers  with  only  the  lower  32  bits 
of  any  64  bit  (e.g.,  S-registers)  symbol  taking  part  in  the  computa- 
tion . 

Examples : 

CALC  O' 131 ’ +D’ 387 ' 

CALC  Al*D' 50'+  B.Rl 
CALC  M{32)+S4 

Error  Responses: 

Calc  unable  to  recognize  operator  - 
Bad  operator  seen  in  expression. 

Calc  unable  to  recognize  operand  - 

Bad  operand  or  invalid  pre-defined  symbol  seen  in  expression. 


CALCULATE  command 
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CHANGE 


Command  Description 

Purpose  : To  alter  Cray-1  storage  locations  in  the  simulator 
Prototype  : CHANGE  symbol  new-value 
Description : 

The  CHANGE  command  allows  any  program  accessible  storage  loca- 
tion in  the  Cray-1  simulator  to  be  changed.  The  symbol  parameter  may 
be  replaced  with  any  of  the  predefined  symbols  which  may  be  changed. 
See  the  DISPLAY  command  description  for  a list  of  the  valid  symbols. 

The  new-value  parameter  may  be  replaced  with  any  pre-defined 
symbol  or  one  of  the  following  constants: 


nnn  - 

for 

an  octal 

integer 

O' 

' nnn ' - 

for 

an  octal 

integer  constant 

D' 

' nnn ' - 

for 

a decimal 

integer  constant 

nnnDnn  - 

for 

a double 

precision  floating  point  constant. 

Examples : 

CHANGE  A1  O' 37  7' 

CFI  M(55)  2.5D0 

CH  V3(14)  2.32D27 

CH  M(50)  M(100) 

CH  B.BREG  T.TREG 

CH  M(l)  D' -1234567890123  ' 

Error  Responses: 

Change  unable  to  modify  symbol  - 

See  the  DISPLAY  command  for  a list  of  the  symbols  that 
may  not  be  changed. 

Change  unable  to  evaluate  symbol  - 

The  symbol  is  not  a legitimate  symbol 


CHANGE  command 
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CLEAR 

Command  description 

Purpose  : To  clear  break  points  and  at  points  4 

Prototype  : CLEAR  [p-addr  ...] 

I 

Description : 

The  CLEAR  command  is  used  to  remove  any  break  points  or  at 
points  that  have  been  previously  set.  If  no  parcel  address  para- 
meters are  specified,  then  all  break  and  at  points  will  be  cleared. 

If  one  or  more  parcel  addresses  are  supplied  as  arguments  to  the 
CLEAR  command,  then  the  break  or  at  points  set  at  these  locations 
will  be  cleared. 

Examples : 

CLEAR 

CL  21A  35C  74D  LOOPl 

Error  Responses: 

Invalid  p-addr  - 

The  p-addr  is  invalid  or  out  of  range. 

No  break  or  at  points  are  set  - 


Nothing  set  at  this  address  - 
Undefined  Symbol  - 

The  symbol  specified  is  not  defined  in  the  current  (IDENT) 
module . 

Symbol  does  not  reference  a parcel  address  - 

The  symbol  has  word  address  or  value  attribute. 
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CLEAR  command 


! 
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COMMENT 


Command  description 

Purpose  : To  provide  documentation  about  a simulator  session 
Prototype  : COMMENT  any  text  string 
Description : 

The  COMMENT  command  is  useful  for  documenting  a terminal  session 
or  for  generating  advisory  notices  from  AT  command  files  or  subroutine 
call-files.  With  AT  command  files,  the  commands  are  not  echoed  to  the 
output  device,  however,  COf^ENT  commands  will  echo.  Also,  on  a sub- 
routine call  to  the  simulator  COMMENT  commands  in  the  subroutine  com- 
mand string  will  echo  regardless  of  the  value  of  the  echo  parameter. 
Both  of  these  features  are  useful  to  remind  the  user  of  any  critical 
information . 

Examples : 

COMMENT  ANY  TEXT  STRING  MAY  BE  SUPPLIED. 

CO  THIS  AT  POINT  ALTERS  S3. 

CO  DON'T  FORGET  TO  SET  UP  LOCATION  34 

Error  Responses: 

None 
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COST 

Command  description 

Purpose  : To  print  out  processing  costs. 

Prototype  : COST 
Description ; 

The  COST  command  will  display  simulator  processing  cost 
information.  The  cost  figures  cover  the  period  from  program 
start-up  or  the  last  INIT  command  to  the  present. 

The  following  information  is  displayed: 


SIMOL^TITM  14<;T  IMIt  _ "^PTstV 


PTC 

CD')  TTw^' 

PRINT  I NO  pccrc; 
INSTP'JrTTPM 
iNSTPufTirv 
MOST  / rr^Y-1 


♦ 

♦ 


0 

0.  ’^7 

0,0 

P07.S 

0.^70 

o.n 


s = c 

LOW  Dpinpf^Y 

0 LINtS  '■>  PAPPS 
INS'^P.  /S'T. 

♦ / 1000  INSTP. 


1)  The  number  of  simulation  clock  periods. 

2)  The  number  of  instructions  issued. 

3)  The  host  (machine  on  which  simulator  is  running) 
cpu  time. 

4)  The  host  dollar  cost  and  job  priority. 

5)  The  printing  costs  (useful  for  CPACT  output) . 

6)  The  simulation  rate  in  issued  instructions  per  host 
cpu  second. 

7)  The  simulation  cost  in  dollars  per  thousand  issued 
instructions. 

8)  The  ratio  of  host  cpu  time  to  the  Cray-1  cpu  time 
using  the  number  of  simulation  clock  periods  multi- 
plied by  12.5  ns. 

Examples : 

COST 


Error  Responses: 
None . 
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CPACT 

Command  description 

Purpose  : To  control  the  generation  of  the  clock  period 
activity  report. 

Prototype  : CPACT  [ fdname  [COMPRESS  j ^COMPRESS]  [WIDE  | t^RROW]  ] 
Description : 

The  CPACT  command  enables  and  disables  the  clock  period  activ- 
ity report.  If  the  fdname  (MTS  file  or  device  name)  is  supplied, 
CPACT  is  enabled  and  the  report  is  directed  to  the  specified  fdname. 
If  no  fdname  is  supplied  CPACT  is  disabled.  It  should  be  noted  that 
enabling  CPACT  will  increase  the  simulation  cost  over  the  non-timing 
simulation  mode  by  roughly  a factor  of  forty  to  fifty.  If  timing 
is  off  (see  the  SET  command)  when  CPACT  is  enabled,  CPACT  will  turn 
TIMING  on. 

Since  one  line  of  output  is  generated  for  each  clock  period  of 
simulation  time,  quite  a bit  of  output  can  be  generated  fairly  fast. 
To  keep  cost  to  a minimum,  under  MTS,  CPACT  should  be  sent  directly 
to  *PRINT*.  If  COMPRESS  is  specified,  identical  hold  issue  lines 
v/ill  be  suppressed.  COMPRESS  is  the  default. 

The  normal  CPACT  report  is  suitable  for  printing  on  132  column 
printers.  If  NARROW  is  specified  or  fdname  corresponds  to  the  user's 
terminal,  the  report  will  be  condensed  to  80  columns. 

The  report  produced  by  CPACT  is  described  in  more  detail  in  sec- 
tion 2.5. 

Examples : 

CPACT  *PRINT* 

CP 

CP  *SINK*  NARROW 
Error  Responses: 

CPACT  already  enabled  - 

CPACT  with  an  fdname  was  given  when  CPACT  was  already 
enabled . 

CPACT  command 
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DEFINE 

Command  Description 
Purpose  : To  define  a new  symbol 
Prototype  : DEFINE  symbol  constant  I , W | , P | , V] 

Descript  ion : 

The  DEFINE  command  adds  a new  symbol  to  the  symbol  table. 

It  may  then  be  used  by  other  simulator  commands. 

The  value  of  the  symbol  will  be  the  constant.  If  the  con- 
stant is  octal,  the  type  of  the  symbol  will  be  word  address. 

If  the  constant  is  modified  octal,  the  type  will  be  parcel 
address . 

The  default  type  may  be  overridden  by  appending  ,w  , P or  , V to 
the  constant.  If  this  is  done,  the  symbol  will  be  defined  as  type 
word  address,  parcel  address,  or  value,  respectively. 

Examples : 


DEFINE 

START 

22B 

DEF 

ARAYl 

100 

DEF 

BNAME 

77, V 

Responses : 

Invalid 

constant 

Invalid 

type  - 

type  was  not  V’,  P,  or  V. 

Duplicate  symbol  - 

the  symbol  is  already  defined. 


65 


DEFINE  command 


DISPLAY 


Command  Description 


Purpose 


Prototype 


To  allow  the  user  to  examine  the  registers  and  memory  of 
the  simulated  Cray-1. 

DISPLAY [ 0fmt ] symbol [, length]  ... 


Description : 

The  DISPLAY  command  provides  a faciltiy  through  which  the  user  may 
examine  Cray-1  registers  and  memory.  The  location  to  be  displayed  (sym- 
bol) is  represented  by  any  of  the  predefined  symbols  shown  in  the  table 
below,  or  a user  defined  symbol.  Subsequent  contiguous  locations  can  be 
displayed  by  providing  a length  parameter,  separated  from  the  symbol  name 
by  a comma.  The  length  parameter  may  be  a symbol  name  (e.g.,  VL)  or  a 
decimal  integer  constant.  Also  noted  in  this  table  is  whether  or  not  the 
CHANGF  command  will  alter  the  symbol.  DISPLAY  output  may  be  aborted  by 
a keyboard  attention. 

Sach  symbol  has  a default  display  format  associated  with  it.  The 
default  may  be  overridden  for  all  symbols  on  the  command  by  appending 
display  format  codes  (fmt)  to  the  command  name.  The  format  codes  string 
is  prefixed  with  These  format  codes  are  defined  as  follows. 


FORMAT 

Code  Meaning 

E Floating  pt. 

F Fixed  pt. 

0 Octal 
P Parcel 

M Modified  Octal 

1 Instruction 
S Symbolic 


DISPLAY 


64  ' 

24  ' 

16' 

Operand 

Operand 

Operand 

Floating  pt. 

N.  A. 

N.A. 

64'  integer 

24'  integer 

16'  integer 

64 ' octal 

24'  octal 

16'  octal 

4 octal  parcels 

N.A. 

16'  octal 

4 M.  octal  parcels 

24 ' M . octal 

16'  M.  octal 

4 Instr.  Mnemonics 

N.A. 

Instr.  Mnemo) 

Symbol 

Symbol 

Symbol 

User  defined  symbols  are  those  symbols  defined  by  the  assembly  lan- 
guage program  and  contained  in  a relocatable  load  module.  These  symbols 
may  be  one  of  three  types:  parcel  address,  word  address  or  value.  A 
parcel  address  symbol  is  treated  as  a 16  bit  operand,  and  names  a parcel 
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memory  location.  A word  address  symbol  is  treated  as  a 64  bit  operand, 
and  names  a word  memory  location.  A value  symbol  may  be  used  to  name 
an  A,B,S,T  or  V reqister. 

The  only  user  defined  symbols  which  may  be  referenced  are  those 
in  the  current  module.  See  the  IDENT  command. 

For  the  operand  - format  code  combinations  which  are  not  applicable, 
no  value  will  be  displayed. 


Pre-Defined  Symbol  Table 


Symbol 

name 

Cray-1 

storage 

location 

Region 

length 

allowed? 

Change 

allowei 

Vn (elt) 

Vector  registers 

Yes 

Yes 

Sn 

Scalar  registers 

Yes 

Yes 

Tnn 

T-registers 

Yes 

Yes 

An 

Address  registers 

Yes 

Yes 

Bnn 

B-registers 

Yes 

Yes 

M (addr ) 

Memory,  words 

Yes 

Yes 

IM (p-addr) 

Memory,  parcels 

Yes 

Yes 

P 

P-register 

No 

Yes 

CIP 

Current  instruction  parcel 

No 

No 

NIP 

Next  instruction  parcel 

No 

No 

LIP 

Lower  instruction  parcel 

No 

No 

VM 

Vector  mask  register 

No 

Yes 

VL 

Vector  length  register 

No 

Yes 

RTC 

Real  time  clock 

No 

Yes 

XP 

Exchange  package 

No 

No 

IBn (elt) 

Instruction  buffers 

Yes 

No 

n or  nn  is  a register  number 

elt  is  an  element  index  within  a register 

addr  is  a word  address,  may  be  an  expression 

p-addr  is  a parcel  address,  may  be  an  expression 
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Examples ; 

DISPLAY  A1  S3  A.LOOPCNTR  V.ROWl  SUBRTNl 
DOO  A0,8 

DOPI  IM(p) » 10  main,  LEN$ 

DOEFO  V0(0),VL 

Error  responses: 

Invalid  format  code  - 

see  format  code  list  on  page  64. 
Invalid  symbol 
Invalid  integer 


DUMP 

Command  description 


Purpose  : To  display  the  contents  of  all  data  areas  of  memory. 

’ ' Prototype  : DUMP  [modulo-name] 

* Description; 

The  DUMP  command  displays  the  contents  of  memory  addressed  by 
all  symbols  of  type  word  address.  The  memory  locations  are  displayed 
in  floating  and  fixed  formats. 

If  a module-name  is  specified,  only  the  module  with  corresponding 
I DENT  name  is  dumped. 

Examples : 

DUMP 

DU  SUBRI 

Error  Responses: 

Module  not  loaded  - 

The  specified  module-name  is  not  the  name  of  any  loaded  module. 


k 


DUMP  conunand 
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ENDFILE 

Command  description 

Purpose  : To  signal  an  end-of-file  condition  to  a USE  command 
Prototype  : ENDFILE 
Description : 

This  command  terminates  the  effect  of  the  current  USE  command. 
It  pops  the  command  stack  causing  input  to  be  read  from  the  previous 
source.  See  section  2.1.1  for  more  information  about  command  input 
control.  If  a USE  command  is  not  in  progress,  ENDFILE  is  a no-op. 
That  is,  ENDFILE  will  not  terminate  a call-file  or  an  AT-file. 

Examples : 

ENDFILE 

E 

Error  Responses: 

None. 


HELP 


Conunand  description 

Purpose  : To  provide  on-line  information  about  command  syntax 
and  function. 

Prototype  : HELP  [cmd-name 
Description : 

The  HELP  command  takes  as  parameters  the  simulator  command 
names.  For  each  command  name  (cmd-name)  given,  a brief  description 
is  printed.  A keyboard  attention  may  be  used  to  abort  the  HELP 
output.  If  no  command  name  is  provided  a list  of  the  legal  commands 
is  printed. 

Examples : 

HELP  DISPLAY  CHANGE 
HELP 
H HELP 

Error  Responses: 

I can't  help  you  - 

The  file  containing  the  HELP  responses  doesn't  exist  or 
couldn't  be  accessed. 

Error  during  HELP  - 

An  error  occurred  during  I/O  to  the  output  device. 


HELP  command 


I DENT 


Command  description 

Purpose  : To  determine  the  subset  of  user  defined  symbols 

which  may  be  referenced  by  other  commands. 

Prototype  : I DENT  module-name 
Description  : 

Relocatable  modules  loaded  by  the  simulator  contain  the  defin- 
itions of  all  symbols  defined  in  the  assembly  language  program.  Since 
assembly  programs  can  be  assembled  and  loaded  independently,  these 
symbols  may  not  be  unique.  Only  those  symbols  defined  within  a single 
module  may  be  used  at  any  given  time. 

The  name  field  on  the  IDENT  command  must  be  the  name  contained  on 
the  IDPJNT  record  of  one  of  the  loaded  modules.  Only  the  symbols  con- 
tained in  that  module  will  be  available  for  use  by  other  commands. 

Examples : 

TDENT  MAIN 
fD  SUnROUTN 

Error  Responses: 

Module  not  loaded  - 

The  name  specified  did  not  appear  on  the  IDENT  card  of  any 
loaded  module. 
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INIT 


Command  description 


Purpose 


To  re-initialize  the  simulator 


Prototype  : INIT 


Description : 

The  INIT  command  allows  re-initialization  of  the  simulator 
state  between  runs  of  a proqram.  It  has  the  following  effects: 


1. 

2 . 

3. 

4. 

5. 

INIT  will 


All  timing  information  is  initialized. 

All  report  information  in  initialized. 

The  simulator  state  is  cleared. 

The  CPU  clock  is  cleared. 

The  CiP,  NIP  and  instruction  buffers  are  invalidated, 
not  alter  the  A, B, S , T, V, VL, MODE y P and  VM  regioters  or 


memory 


Examples 

INIT 

I 


Error  Responses: 
None . 
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LOAD 


Command  description 

Purpose  : To  load  programs  into  the  simulated  CRAY-1  memory. 

Prototype  : LOAD  [s.a.]  fdname  ... 

Description ; 

The  LOAD  command  opens  the  file  or  device  (fdname)  and  reads 
one  or  more  load  modules  from  it.  The  load  modules  may  be  absolute 
or  relocatable.  See  appendix  K for  a discussion  of  load  module 
formats . 

Absolute  load  modules  are  loaded  at  the  address  specified  in 
the  module.  The  octal  starting  address  (s.a.),  if  specified,  pre- 
ceding the  fdname,  is  ignored. 

Relocatable  modules  are  loaded  at  the  first  available  16  word 
boundary,  unless  an  octal  starting  word  address  (s.a.)  is  specified. 
Modules  will  be  relocated  and  linked  by  the  loader. 

Examples : 

LOAD  OBJ 
LOAD  30  FILEl 
LOAD  * SOURCE* 

Error  responses 

Unresolved  externals  exist  - 

A relocatable  module  references  a module  which  is  not  loaded. 
The  user  will  be  prompted  for  more  loader  input. 
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MAP 

Command  Description 

Purpose  : To  display  the  locations  of  all  loaded  modules 

Prototype  : MAP  (XRKF] 

Description  : 

The  MAP  command  will  display  the  names  of  all  loaded  modules  and 
their  true  addresses  in  CRAY-1  memory. 

If  XREF  is  specified,  a two  part  cross  reference  table  will  be 
produced.  The  first  i>art  lists  each  module  and  the  other  modules  it 
references.  The  second  part  lists  each  module  and  all  modules  which 
reference  it. 

Examples : 

MAP 

MAP  XREF 
MA  X 

Error  responses: 

None . 


I 
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MAP  COMMAND 


MTS 


Command  description 

Purpose  : To  provide  a command  interface  between  the 
simulator  and  MTS. 

Prototype  : 1.  MTS  [mts-command] 

2.  $mts-command 

Description : 

The  MTS  command  allows  the  user  to  pass  commands  to  MTS 
without  stopping  the  simulator.  In  the  first  prototype  an 
optional  MTS  command  may  be  supplied.  Return  is  made  to  MTS 
with  MTS  processing  the  command.  The  user  may  restart  the 
simulator  with  the  $RRSTART  MTS  command.  The  second  prototype 
allows  the  issuing  of  a one-shot  MTS  command.  That  is,  the 
command  is  passed  to  MTS  but  control  returns  to  the  simulator 
automatically  when  the  command  finishes.  Any  command  input 
to  the  simulator  that  is  prefixed  with  a dollar  sign  is  treated 
as  one-shot  MTS  command. 

Examples : 

MTS 

M DIS  VMSIZE 
$EMPTY  -RPT 
$EDIT  TRIDEC 
$SDS 

Error  Responses: 

None . 
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REMOVE 

Command  Description 

Purpose  : To  remove  a symbol  from  the  simulator's  symbol 
table . 

Prototype  : REMOVE  symbol 
Description : 

The  symbol  specified  is  removed  from  the  simulator  tables. 
No  further  command  language  references  to  it  will  be  allowed, 
unless  it  is  redefined. 

Examples : 

REMOVE  .START 
REM  ARRAY 1 

Error  Response: 

symbol  is  not  defined. 


RETURN 

Command  description 

Purpose  : To  allow  the  simulator  to  return  to  it' s caller. 
Prototype  : RETURN 
Description : 

The  RETURN  command  is  used  to  force  the  simulator  to  return 
to  it's  caller.  Normally,  when  the  simulator  is  called  as  a sub- 
routine, the  CRAYl  interface  subroutine  will  automatically  place 
a RETURN  command  at  the  end  of  the  call-file  after  all  other  user 
commands.  As  the  call-file  is  processed  this  RETURN  will  eventually 
be  executed.  To  cause  an  early  return  to  the  caller,  the  user  may 
issue  a RETURN  command,  thereby  skipping  the  remaining  commands  in 
the  call-file. 

A RETURN  command  issued  when  running  the  simulator  stand-alone 
is  equivalent  to  a STOP  command. 

Examples : 

RETURN 

Error  Responses: 

None. 


( 

) 


Conunand  description 


Purpose  : To  begin  simulation  of  a Cray-1  program 
Prototype  : RUN  [p-addr]  [#issue-limit] 

Description : 

The  RUN  command  is  the  only  simulator  command  that  actually 
begins  the  simulation. 

The  optional  starting  parcel  address  (p-addr)  is  the  modified 
octal  address  » Or  symbol  with  parcel  address  attribute,  of  the  first 
instruction  to  execute,  if  no  p-addr  is  supplied,  execution  begins  with 
the  parcel  pointed  to  by  the  Cray-1  P address  register.  If  a p-addr  is 
specified,  an  instruction  buffer  fetch  sequence  v/ill  always  be  forced 
even  if  the  instruction  is  already  in  a parcel  buffer.  If  no  p-addr  is 
supplied,  a buffer  fetch  sequence  will  occur  only  if  the  P register 
points  to  an  out  of  buffer  instruction.  (Note:  changing  the  P register 
by  the  CHANGE  command  always  forces  a buffer  fetcn  sequence.)  Conse- 
quently, if  no  p-addr  is  given  the  program  timing  will  be  unaffected  by 
whatever  previously  halted  the  simulation  (a  break  point,  an  attention, 
etc . ) . 

The  optional  issue  limit  parameter  can  be  used  to  control  the 
simulation  by  limiting  the  number  of  instructions  that  mav  issue. 

This  must  be  a positive  decimal  number  prefixed  by  a pound  sign  and 
is  used  to  prevent  run  away  programs  or  to  allow  single  stepping 
through  a program.  If  an  issue  limit  is  not  provided  the  remainder 
of  a previous  issue  limit  is  used  or  if  no  remainder  is  left,  a de- 
fault value  of  1000  is  supplied. 

A single  keyboard  attention  may  be  issued  to  stop  the  simulation. 
The  simulation  will  be  halted  in  such  a way  that  no  information  is 
lost.  The  issue  of  a RUN  command  will  continue  the  simulation  as  if  it 
had  never  been  interrupted. 

Two  keyboard  attentions  may  be  issued  to  exit  to  MTS  if  a single 
one  doesn't  stop  the  simulation.  This  could  occur  if  either  the 
simulator  or  a CRAYEX  routine  were  looping.  See  section  2.4  about 
the  CRAYEX  dispatcher. 
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There  are  many  conditions  that  can  arise  to  stop  the  simula- 
tion. Normally,  a run  command  will  terminate  when  an  EX  instruction 
(004000)  is  executed  and  this  is  the  usual  way  to  stop  a program. 
Other  common  conditions  that  stop  simulation  are  break  points,  at 
points,  issue  limit  expired  or  a keyboard  attention.  The  except- 
ional conditions  that  halt  simulation  are  discussed  in  section  2.2 

Examples : 

RUN  23C  #5000 

RUN 

R MAINPROG 

R#1  - to  single  step. 

Error  Responses: 

RUN  unable  to  evaluate  expression  - 

Either  the  p-addr  or  the  issue  limit  was  invalid. 

The  offending  parameter  is  printed. 
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RUN  command 


SET 


Command  Description 

Purpose  : To  permit  alteration  of  user  setable  switches 

Prototype  : SET  lhs=rhs 

Description: 

The  SET  command  allows  the  user  to  control  some  of  the  feat- 
ures of  the  simulator.  Each  parameter  is  composed  of  a left  hand 
side  (Ihs) , an  equal  sign  and  a right  hand  side  (rhs) . The  left 
hand  sides  are  the  keyword  names  and  the  right  hand  sides  are 
the  new  keyword  values.  The  table  below  lists  the  legitimate 
left  hand  sides  followed  by  a discussion  of  each  one. 


Keyword 

Keyword  values 

Default 

EFI 

ON,  OFF 

ON 

^SLIMIT 

positive  integer 

1000 

MACHINE 

CRAYl,  CRAYl-A 

CRAYl 

MEMORY 

SECDED,  PARITY 

SECDED 

OUTPUT 

any  fdname 

*MSINK* 

TIMING 

ON,  OFF 

OFF 

TITLE 

a character  string 

blank 

not  exceeding  35  bytes 


EFI  default:  ON 

The  EFI  (enable  floating  point  interrupt)  keyword  allows 
user  control  of  interrupts  caused  by: 

1)  Exponent  overflow 

2)  Exponent  underflow 

3)  Floating  point  division  by  zero. 

If  EFI  is  ON,  the  above  three  conditions  will  stop  simulation.  If 
EFI  is  OFF,  these  three  conditions  will  be  ignored  when  they  occur. 
When  the  simulator  starts  up  EFI  is  on  by  default.  EFI  is  a mode 
bit  in  the  Cray-1  mode  register  and  the  Cray-1  instructions  EFI  and 
DFI  can  also  set  or  clear  this  bit. 
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ISLIMIT 


Default;  1000 

The  ISLIMIT  keyword  allows  the  user  to  change  the  default 
instruction  issue  limit.  If  no  issue  limit  is  specified  on  the 
RUN  command  and  no  remaining  issue  limit  exists  from  previous 
run  commands,  the  default  instruction  issue  limit  is  used.  A 
positive  decimal  integer  must  be  specified  on  the  right  hand 
side.  When  the  simulator  starts  up  this  keyword  has  a default 
value  of  1000.  Setting  ISLIMIT  to  one  is  useful  for  single  stepping 
through  the  program. 

MACHINE  Default;  CRAYl 

This  keyword  is  intended  for  selecting  the  use  of  experimental 
architectural  modifications  to  the  Cray-1  simulator.  The  current 
legitimate  keyword  values  are  "CRAYl"  and  "CRAYl-A" , with  default 
being  "CRAYl".  When  CRAYl  is  selected,  normal  Cray-1  timing  is 
in  force.  Currently,  selecting  CRAYl-A  invokes  only  one  Cray-1 
architectural  modification;  that  of  improved  memory  bandwidth. 

With  CRAYl-A,  the  data  rates  (in  words  per  clock  period)  for  block 
transfers  (instructions  034-037,  176,  177)  to  and  from  main  memory 
are  shown  in  the  table  below.  These  data  rates  are  a function  of 


the  address  increment  (K)  used 

037,  (Ak)  for  176,  177) . 

K mod  16  Data  Rate  (wds/cp) 

by  the  block 

K mod  1 6 

transfer  (one  for  034- 

Data  Rate  (wds/cp) 

0 

.25 

8 

.5 

1 

4 

9 

4 

2 

2 

10 

2 

3 

4 

11 

4 

4 

1 

12 

1 

5 

4 

13 

4 

6 

2 

14 

2 

7 

4 

15 

4 

When  CRAYl-A  is  selected,  chaining  a vector  arithmetic  instruction 
off  of  a vector  memory  load  (176)  is  disallowed.  This  is  because 
of  the  possible  imbalance  in  data  rates  between  the  two  instructions. 
In  general,  this  should  not  be  a hardship  since  a reordering  of 
vector  instructions  usually  allows  one  to  stagger  the  vector  memory 
references  to  run  in  parallel  with  the  arithmetic  vector  instruc- 
tions. 
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SET  command 


MEMORY  Default:  SECDED 

The  first  Cray-1  built  by  Cray  Research  Inc.  has  a memory 
which  is  protected  by  parity  checking  only.  This  was  later  found 
to  be  unsatisfactory  and  subsequent  machines  were  built  with  SEC- 
DED (single  error  correction  - double  error  detection)  memory 
protection.  By  introducing  SEC-DED  on  the  memory,  the  access 
path  to  memory  is  one  clock  period  longer  than  on  the  parity  checked 
memory.  This  timing  difference  is  user  selectable  in  the  simulator. 

By  setting  MEMORY  to  the  value  PARITY  (e.g.,  SET  MEMORY=PARITY) , 
timing  with  the  parity  checked  memory  is  possible.  When  the  simu- 
lator starts  up  the  default  memory  timing  is  SECDED. 

OUTPUT  Default:  *MSINK* 

The  OUTPUT  keyword  controls  the  file  or  device  to  which  the 
simulator  sends  all  normal  output  (i.e.,  not  prompts  or  error  mess- 
ages, which  always  go  to  the  terminal) . Normal  output  includes  in- 
formational messages,  DISPLAY,  HELP,  and  STAT  output.  When  the 
simulator  starts  up  OUTPUT  is  set  to  *MSINK*  (the  terminal) . With 
the  SET  command  OUTPUT  may  be  set  to  another  file  or  device.  A key- 
board attention  will  switch  the  output  back  to  *MSINK*  automatically. 

TIMING  Default:  OFF 

The  TIMING  keyword  controls  simulator  resource  timing.  If 
TIMING  is  off,  only  the  results  of  instruction  execution  are  com- 
puted. If  TIMING  is  on,  resource  timing,  reservation  and  issue  con- 
straints are  simulated.  By  default,  TIMING  is  off  when  the  simulator 
starts  up.  Setting  TIMING  on  increases  the  simulation  cost  by  a 
factor  of  eight  to  ten.  TIMING  may  also  be  enabled  and  disabled 
by  the  ERT  and  DRT  instructions  respectively.  See  section  5 for 
more  explanation  on  ERT  and  DRT.  Timing  must  be  enabled  to  produce 
the  CPACT  report.  However,  the  enabled  or  disabled  state  of  CPACT 
is  independent  of  the  setting  of  TIMING.  That  is,  turning  TIMING 
on  and  off  won't  affect  the  enabled  state  of  CPACT.  However,  the 
CPACT  report  is  not  producted  while  TIMING  is  disabled,  but  it  will 
be  resumed  when  TIMING  is  turned  back  on. 
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SET  command 


TITLE 


Default:  blank 


The  TITLE  keyword  allows  changing  the  title  field  on  the  CPACT 
and  STAT  reports.  This  title  is  always  set  when  a LOAD  command 
processes  the  title  field  on  the  load  module  header  record.  If  no 
title  field  is  provided  TITLE  is  set  to  blanks.  The  right  hand 
side  of  the  TITLE  keyword  may  be  a string  up  to  35  characters 
long.  No  delimiters  are  required  and  the  string  may  contain  em- 
bedded blanks.  All  35  characters  will  appear  on  the  CPACT  report 
but  only  the  first  25  characters  are  printed  on  the  STAT  report. 

Examples : 

SET  TIMING=ON  OUTPUT=-FILE  EFI=OFF 

SET  TITLE=THE  CPACT  TITLE  TIM=OFF  OUT=*PRINT* 

SET  MEMORY=PARITY  ISLIMIT=2500  MACHINE=CRAYl-A 

Error  Responses: 

No  equal  sign  found  - 
Invalid  left  hand  side  - 

The  keyword  on  the  left  hand  side  is  unrecognizable. 
Invalid  right  hand  side  - 

The  right  hand  side  must  be  among  the  possible  right 
hand  sides  for  a given  left  hand  side. 

SET  will  have  processed  all  keyword  assignments  up  to  the 
point  of  the  error.  An  error  aborts  the  rest  of  the  SET  command. 
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SET  command 


STAT 


Command  description 

Purpose  : To  print  out  Cray-1  resource  usage  statistics 
Prototype  : STAT  [FULL] 

Description : 

This  command  will  print  on  the  current  output  device  a sum- 
mary report  of  Cray-1  resource  usage.  This  report  is  composed 
of  the  following  three  sections: 

1.  Vector  Usage  counts 

2.  Floating  point  result  counts 

3.  Data  traffic  counts 

The  vector  usage  counts  section  is  a timing  measure  of  the 
program's  vector  use  of  the  Cray-1  vector  functional  units.  The 
data  for  this  section  is  only  collected  when  TIMING  is  ON  . If 
TIMING  is  OFF  when  the  STAT  command  is  issued,  this  section  will 
not  be  printed  since,  most  likely,  it  would  all  be  zero. 

The  floating  point  result  counts  section  is  a measure  of  the 
program's  use  of  floating  point  computation.  Floatinq  point  add- 
ition, multiplication  and  reciprocal  operations  are  tabulated  for 
both  vector  and  scalar  instructions.  The  data  for  this  section 
is  alwavs  collected  regardless  of  the  state  of  TIMING. 

The  data  traffic  counts  section  is  a measure  of  the  data 
(operands  & results)  flow  throughout  the  Cray-1.  Each  major  Cray-1 
data  path  is  illustrated  on  a figure,  that  is  part  of  the  report, 
along  with  the  amount  of  traffic  on  each  path.  Also  included  in 
this  section  are  some  calculations  of  ratios  and  percentages 
based  on  the  data  traffic  statistics.  The  formulas  used  for  each 
calculation  are  printed  beyond  column  80  of  the  line  containing 
the  calculated  number.  Normally,  these  formulas  won't  appear  on 
an  80  column  terminal,  but  will  be  printed  if  the  STAT  output  is 
diverted  (via  SET  OUTPUT=*PRINT* ) to  the  line  printer. 

The  data  for  this  section  is  always  collected  regardless  of 
the  state  of  TIMING.  This  section  will  not  be  printed  unless  the 
FULL  option  is  specified  on  the  stat  command.  The  INIT  command 
will  reinitialize  the  STAT  data  collection. 


STAT  command 
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This  discussion  is  intended  as  a brief  conunand  description. 
For  a detailed  discussion  of  both  the  STAT  and  CPACT  reports 
see  section  2.5. 

Examples : 

STAT 

STAT  FULL 

SET  OUTPUT=*PRINT* 

STAT 

SET  OUTPUT=*MSINK* 

Error  responses: 

Extraneous  parameter  on  STAT  command  - 

This  occurs  if  FULL  is  misspelled  or  improperly  abbrev- 
iated. 
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STAT  command 


STOP 


Command  description 

Purpose  : To  terminate  execution  of  the  Cray-1  simulator. 
Prototype  : STOP 
Description : 

The  STOP  command  terminates  execution  of  the  simulator,  re- 
leases virtual  memory  used  by  the  simulator  and  returns  to  MTS. 

Examples : 

STOP 

Error  Responses: 

None . 
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STOP  command 


USE 


Conunand  description 

Purpose  : To  switch  the  command  input  stream  to  a file. 

Prototype  : USE  fdname  [NOECHO] 

Description : 

This  command  allows  the  user  to  put  a long  or  frequently  used 
command  sequence  in  a file  and  have  the  simulator  process  those 
commands  from  that  file.  The  fdname  parameter  is  replaced  with  the 
name  of  an  MTS  file  or  device  from  which  the  simulator  will  read 
subsequent  commands.  Commands  read  from  the  file  will  automatically 
echo  onto  the  current  output  device  unless  the  optional  NOECHO 
parameter  is  specified. 

Any  end-of-file  condition  or  an  ENDFILE  command  will  terminate 
thf  USE  command.  This  will  pop  the  command  stack  causing  input  to 
resume  with  the  previous  source.  The  command  stack  is  fifteen  levels 
deep,  allowing  the  user  to  nest  USE  commands  as  desired. 

A keyboard  attention  may  be  used  to  abort  any  and  all  outstanding 
USE  commands  by  resetting  the  command  stack.  This  will  cause  the 
terminal  to  be  current  input  device. 

Examples: 

USE  DISPFILE 

USE  *MSOURCE*  - to  read  from  the  terminal 
U CMOS  NOECHO 

Error  Responses: 

USE  command  unable  to  open  file  - 

The  given  fdname  doesn't  exist  or  access,  not  allowed. 

FDUB  command  stack  Overflow- 

Attempt  to  nest  more  than  15  USE  commands. 
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USE  command 


5. 


Cray  - 1 Simulation  Cost 

This  section  will  discuss  the  three  different  levels  of 
simulation  and  compare  their  costs.  The  three  simulation  levels 
in  increasing  order  of  expense  are: 

(1)  Non- timing  simulation.  Only  results  are  computed. 

(TIMING  = OFF) 

(2)  Timing  simulation  and  result  computation.  (TIMING  = ON) 

(3)  Resource  activity  trace  simulation.  (CPACT  report) 

To  compare  the  relative  costs  of  the  different  levels,  we  took 
a full  matrix  LU  decomposition  algorithm  (with  a 10  x 10  matrix)  and 
simulated  it  at  all  three  levels.  In  addition,  we  wrote  a Fortran 
version  and  ran  it  directly  on  the  host  machine  (an  Amdahl  470/V6) 
used  by  the  simulator.  The  results  of  the  comparison  are  shewn  in 
Table  1. 


The  four  comparison  tests  are: 

Test  - A:  Fortran  version  compiled  in  IBM  Fortran  - H. 

Test  - B:  Simulated  Cray  -1  program,  timing  off. 

Test  - C:  Simulated  Cray  -1  program,  timing  on. 

Test  - D:  Simulated  Cray  -1  program,  CPACT  report  generated. 

The  table  column  headings  are  left  to  right, 

- The  host  epu  time  in  milliseconds. 

- The  host  epu  time  normalized  to  Test  -B  (fastest  simulation 
mode)  . 

- The  host  epu  time  normalized  to  Test  -A  (Fortran  version) . 

- The  simulation  instruction  rate  in  instruction  issues/second. 

- The  host  cost  in  tenths  of  a cent. 

- The  host  cost  normalized  to  Test  -B. 

- The  host  cost  normalized  to  Test  -A. 

- The  simulation  instruction  cost  in  tenths  of  a cent  per  1000 
instructions . 

- The  ratio  of  the  host  epu  time  to  the  predicted  Cray  - 1 epu 
time.  The  Cray  -1  would  run  the  problem  approximately  13.4 
times  faster  than  the  Amdahl  Fortran  version. 

From  the  table  it  is  seen  that  both  the  host  epu  time  and 
cost  increases  by  a factor  of  10  with  timing  envied  and  increases 
by  a factor  of  30  when  the  resource  activity  trace  (CPACT)  is  enabled. 
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Table-1  Cray-1  Simulator  Cost  Comparison 

For  an  LU  Decomposition  Of  a 10x10  Matrix 


The  ratio  of  simulation  speed  (on  the  Amdahl  470/V’6)  to  Cray-l 
speed  is  roughly: 

3,100  to  1 with  no  timing, 

30,500  to  1 with  timing  and, 

92,200  to  1 with  the  activity  trace. 

With  simulation  on  the  Cray  -1  these  ratios  would  probably  drop  by 
a factor  of  three  to  five. 


lii  addition  to  the  SET  comiaand,  which  may  turn  timing  on  or 
ofr,  two  additional  instructions  ERT  and  DRT  have  been  defined  which 
also  affect  the  simulator  timing.  ERT  (002600)  enables  the  CKAY-1 
resource  timing  and  is  equivalent  to  the  command  SET  TIMING=ON. 

DRT  (002700)  disables  CRAY-1  resource  timing  and  is  equivalent  to 
SET  TIMING=OFF.  To  speed  simulation,  these  instructions  may  be 
placed  around  code  where  timing  is  not  desired. 
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Appendix  A. 

Summary  of  Cray-1  Timing  Information 

This  material  has  been  borrowed  from  the  Cray-1  Reference 
Manual,  publication  number  2240004,  by  Cray  Research,  Inc. 


When  issue  conditions  are  satisfied  an  instruction  completes  in  a fixed 
amount  of  time.  Instruction  issue  may  cause  reservations  to  be  placed 
on  a functional  unit  or  registers.  Knowledge  of  the  issue  conditions, 
instruction  execution  times  and  reservations  permit  accurate  timing  of 
code  sequences.  Memory  bank  conflicts  due  to  I/O  activity  are  the  only 
element  of  unpredictability. 

SCALAR  INSTRUCTIONS 

Four  conditions  must  be  satisfied  for  issue  of  a scalar  instruction: 

1.  The  functional  unit  must  be  free.  No  conflicts  can  arise  with  other 
scalar  instructions,  however  vector  floating  point  instructions 
reserve  the  floating  point  units.  Memory  references  may  be  delayed 
due  to  conflicts. 

2.  The  result  register  must  be  free. 

3.  The  operand  register  must  be  free. 

4.  Issue  is  delayed  1 clock  period  if  a result  register  group  input  path 
conflict  would  exist  with  a previously  issued  instruction.  One  input 
path  exists  for  each  of  the  four  register  groups  (A,  B,  S and  T). 

Scalar  instructions  place  reservations  only  on  result  registers.  A result 
register  is  reserved  for  the  execution  time  of  the  instruction.  No 
reservations  are  placed  on  the  functional  unit  or  operand  registers. 

A transmit  scalar  mask  instruction  to  Si  (073)  instruction  is  delayed 
by  (VL)  + 6 clock  periods  from  the  issue  of  a previous  vector  mask 
(175)  instruction,  and  is  delayed  by  6 clock  periods  from  the  issue 
of  a preceding  transmit  (Sj)  to  VM  (003)  instruction. 
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Execution  times  in  clock  periods  are  given  below.  An  asterisk  indicates 
that  issue  may  be  delayed  because  of  a functional  unit  reservation  by  a 
vector  instruction.  Memory  may  be  considered  a functional  unit  for  timing 
considerations , 

(A=A  register,  M=Memory,  B=B  register,  S=S  register,  I=Immediate,  C=Channel) 


24-bit  results: 


A^M 

11* 

A-*-C 

4 

M-^A 

1* 

A-^A+A 

2 

A-*-B 

1 

A-*— AxA 

6 

B-*-A 

1 

A-*— pop(S) 

4 

A-^S 

1 

A Izc(S) 

3 

A-^I 

1 

VL-^A 

1 

64-bit  results: 

S-^-M 

11* 

s-*— S+S 

3 

M-*-S 

1* 

S-^S(f.add)S 

6* 

S-^T 

1 

S -•-S(f.mult)S 

7* 

T 

1 

S -*—5(^.3.) 

14* 

1 

S-^V 

5 

S-^S(log.)S 

1 

v-*-s 

3 

S S(shift)I 

2 

S-^VM 

1 

S -^S(shift)A 

3 

S-*-RTC 

1 

S -^S(mask)I 

1 

S-*-A 

2 

RTC  -*-S 

1 

VM-^S 

3 

* Issue  may  be  delayed  because  of  a functional  unit  reservation  by  a 
vector  instruction.  Memory  may  be  considered  a functional  unit  for 
timing  considerations, 

VECTOR  INSTRUCTIONS 

Four  conditions  must  be  satisfied  for  issue  of  a vector  instruction: 

1.  The  functional  unit  must  be  free.  (Conflicts  may  occur  with  vector 
operations, ) 

2.  The  result  register  must  be  free.  (Conflicts  may  occur  with  vector 
operations. ) 

3.  The  operand  registers  must  be  free  or  at  chain  slot  time. 

4.  Memory  must  be  quiet  if  the  instruction  referemces  memory. 

Vector  Instructions  place  reservations  on  functional  units  and  registers 

for  the  duration  of  execution. 

1.  Functional  units  are  reserved  for  VL+4  clock  periods.  Memory  is 
reserved  for  VL+5  clock  periods  on  a write  operation,  VL+4  clock 
periods  on  a read  operation. 
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2.  The  result  register  is  reserved  for  the  functional  unit  time 
+(VL+2)  clock  periods.  The  result  register  is  reserved  for  the 
functional  unit  +7  clock  periods  if  the  vector  length  is  less  than 
5.  At  functional  unit  time  +2  (chain  slot  time)  a subsequent 
instruction,  which  has  met  all  other  issue  conditions,  may  issue.  This 
process  is  called  "chaining."  Several  instructions  using  different 
functional  units  may  be  chained  in  this  manner  to  attain  a significant 
enhancement  of  processing  speed. 

3.  Vector  operand  registers  are  reserved  for  VL  clock  periods.  Vector 
operand  registers  are  reserved  for  5 clock  periods  if  the  vector 
length  is  less  than  5.  The  vector  register  used  in  a block  store  to 
memory  (177  instruction)  is  reserved  for  VL  clock  periods.  Scalar 
operand  registers  are  not  reserved. 

Vector  instructions  produce  one  result  per  clock  period.  The  functional 
unit  times  are  given  below.  The  vector  read  and  write  instructions 
(176,  177)  produce  results  more  slowly  if  bank  conflicts  arise  due  to 
the  increment  value  (Ak)  being  a multiple  of  8.  Chaining  cannot  occur 
for  the  vector  read  or  write  operation  in  this  case. 

If  (Ak)  is  an  odd  multiple  of  8,  results  are  produced  every  2 clock 
periods. 

If  (Ak)  is  an  even  multiple  of  8,  results  are  produced  every  4 clock 
periods. 

Memory  must  be  quiet  before  issue  of  the  B and  T register  block  copy 
instructions  (034-037).  Subsequent  instructions  may  not  issue  for  144- (Ai) 
clock  periods  if  (Ai)/0  and  5 clock  periods  if  (Ai)=0  when  reading 
data  to  the  B and  T registers  (034,036).  They  may  not  issue  for  6+(Ai) 
clock  periods  when  storing  data  (035,037). 

The  B and  T register  block  read  (034,036)  instructions  require  that  there 
be  no  register  reservation  on  the  A and  S registers,  respectively,  before 
issue. 
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Branch  instructions  cannot  issue  until  an  AO  or  SO  operand  register  has 
been  free  for  one  clock  period.  Fall -through  in  buffer  requires  two 
clock  periods.  Branch-in-buffer  requires  five  clock  periods.  When  an 
"out  of  buffer"  condition  occurs  the  execution  time  for  a branch 
instruction  is  14  clock  periods. 

A two  parcel  instruction  takes  two  clock  periods  to  issue. 

Instruction  issue  is  delayed  2 clock  periods  when  the  next  instruction 
parcel  is  in  a different  instruction  parcel  buffer.  Instruction  issue  is 
delayed  14  clock  periods  if  the  next  instruction  parcel  is  not  in  an 
instruction  parcel  buffer. 

HOLD  MEMORY 

A delay  of  1,  2,  or  3 CP  will  be  added  to  a scalar  memory  read  if  a bank 
conflict  occurs  with  rank  C,  B,  or  A,  respectively,  of  the  memory  access 
network.  A conflict  occurs  if  the  address  is  in  the  same  bank  as  the 
address  in  rank  C,  B,  or  A.  Conflicts  can  occur  only  with  scalar  or  I/O 
references.  The  scalar  instruction  senses  the  conflict  condition  at 
issue  time  + 1 CP.  The  scalar  instruction  address  enters  rank  A of  the 
memory  access  network  at  issue  time  + 1 CP.  The  scalar  instruction 
address  enters  rank  B at  issue  + 2 CP.  The  scalar  instruction  address 
enters  rank  C at  issue  + 3 CP. 

Scalar  instruction  timing  (no  conflict): 


CP 

n 

Issue,  reserve  register 

CP 

n+1 

Address  rank  A,  sense  conflict 

CP 

n+2 

Address  rank  B 

CP 

0 

n+3 

Address  rank  C 

cl 

n+9 

Clear  register  reservation 

CP 

n+10 

Issue 
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HOLD  ISSUE 


A delay  of  issue  results  if  a 100  - 137  instruction  is  in  the  CIP  register 
and  a hold  memory  condition  exists.  The  delay  will  depend  on  the  hold 
memory  delay. 

A delay  of  issue  results  if  a 100  - 137  instruction  is  in  the  CIP  register 
and  a 100  - 137  instruction  in  process  senses  a conflict  with  rank  A,  B, 
or  C . 

An  additional  1 CP  delay  is  added  to  a hold  memory  condition  if  a 070 
instruction  conflict  is  sensed. 
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Appendix  B. 

Cray-1  Simulator  I/O  Device  Usage 


I/O  in  the  Cray-1  simulator  is  done  in  two  ways: 

1)  Through  the  use  of  standard  Fortran  data  set 
reference  numbers  (DSRN)  and, 

2)  Through  the  use  of  an  MTS  environment  file  or  device 
usage  block  (FDUB) . 

The  following  I/O  is  done  through  DSRNs: 

- All  error  messages  use  I/O  unit  0. 

- All  CPACT  output  uses  I/O  unit  1. 

- All  LOAD  module  input  uses  I/O  unit  2. 

- All  normal  Terminal  output  (echoing,  etc.)  uses  I/O 
unit  3. 

- All  memory  to  memory  I/O  used  for  number  conversion,  etc., 
uses  I/O  unit  20. 

The  following  I/O  is  done  through  MTS  provided  FDUBs: 

- All  command  input,  whether  from  a call-file,  an  AT-file, 

a USE- file  or  the  terminal  is  read  using  FDUBs.  The  command 
input  stack  is  implemented  with  FDUBs. 

- All  HELP  file  responses  are  read  from  a file  using  a FDUB. 

- The  simulator  driver  tables  are  loaded  at  start  up  time 
using  a FDUB. 

The  user  should  not  use  DSRNs  0,  1,  2,  3 and  20. 
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Appendix  C. 

Cray-1  Simulator  Common  Block  Usage 


The  Cray-1  simulator  currently  uses  23  Fortran  named  common 
blocks.  Except  for  /MEMORY/  and  /MSIZE/  the  user  should  not  de- 
fine symbols  (subroutines  or  named  common  blocks)  that  conflict 
with  common  block  names  used  by  the  simulator.  These  common 
block  names  are  listed  below: 


ACTFLG 

BRKCOM 

COM$F 

CONTRL 

CTABLE 

DECTBL 

DEV 

DRVTBL 
INSTRX 
MS FLAG 
QCODES 
LOAD 


QCOM 

REG 

REPORT 

SETABL 

STATE 

SYMTBl 

SYMTB2 

TRKBLK 

USAGE 

XCHANG 

$ 
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Appendix  D. 

Establishing  the  Simulator  on  MTS 

In  addition  to  the  object  module  which  contains  the  Cray-1 
simulator,  three  additional  files  and  one  initialization  program 
are  part  of  the  simulator. 

The  initialization  program  (TABINIT)  process  the  instruction 
driver  table  used  by  the  simulator.  TABINIT  converts  the  driver 
table  from  a character  format  to  an  internal  binary  format  which 
may  be  quickly  read  by  the  simulator  when  it  starts  up.  This 
program  is  only  needed  if  one  changes  the  driver  table. 

The  three  additional  files  are: 

1)  OPFILE  : The  character  format  driver  table  used 

as  input  to  TABINIT.  (Not  directly 
necessary  to  use  the  simulator.) 

2)  TABLES.DAT:  The  binary  file  which  is  output  by 

TABINIT.  This  file  needed  to  run 
the  simulator. 

3)  HELP  : This  file  contains  the  help  responses. 

It  is  not  essential  to  use  the  simulator. 

If  TABLES.DAT  and  HELP  are  available  under  the  CCID  that  is 
running  the  simulator,  they  will  be  read  as  they  are  needed. 

Alternatively,  one  can  recompile  the  subroutine  OPFDUB  (open 
fdub) , after  modifying  the  CCID  defined  in  a DATA  statement.  This 
CCID  should  point  to  an  alternate  MTS  catalog  where  TABLES.DAT  and 
HELP  can  be  found. 
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Appendix  E. 

Cray-1  Instruction  Summary 

This  summary  has  been  borrowed  from  the  Cray-1  Reference  < 

Manual,  publication  number  2240004,  by  Cray  Research,  Inc. 


CRAY  1 

CAL 

UNIT 

DESCRIPTION 

000000 

ERR 

. 

Error  exit 

♦OOOi  jk 

ERR  exp 

- 

Error  exit 

OOIOjk 

CA.Aj 

Ak 

Set  the  channel  (Aj)  current  address  to 
(Ak)  and  begin  the  I/O  sequence 

001  Ijk 

CL.Aj 

Ak 

Set  the  channel  (AJ)  limit  address  to  (Ak) 

0012j0 

Cl  .Aj 

Clear  channel  (Aj)  interrupt  flag 

0013]0 

XA 

Aj 

Enter  XA  register  with  (Aj) 

OOMjO 

RT 

Sj 

Enter  real-time  clock  register  with  (Sj) 

00200k 

VL 

Ak 

Transmit  (Ak)  to  VL  register 

♦002000 

VL 

1 

Transmit  1 to  VL  register 

002100 

EFI 

Enable  interrupt  on  floating  point  error 

002200 

DFI 

Disable  interrupt  on  floating  point  error 

0030j0 

VM 

Sj 

Transmit  (Sj)  to  VM  register 

♦003000 

VM 

0 

Clear  VM  register 

004000 

EX 

Normal  exit 

t004  i jk 

EX 

exp 

Normal  exit 

OOSOjk 

J 

Bjk 

Jump  to  (Bjk) 

000 1 3 km 

J 

exp 

Jump  to  exp 

007 1 j km 

R 

exp 

Return  jump  to  exp,  set  BOO  to  P 

0 I Oi 3 km 

JAZ 

exp 

Branch  to  exp  if  (AO)  ■ 0 

0 1 1 1 j km 

JAN 

exp 

Branch  to  exp  if  (AO)  / 0 

Ollijkm 

JAP 

exp 

Branch  to  exp  if  (AO)  positive 

0 1 3 i j km 

JAM 

exp 

Branch  to  exp  if  (AO)  negative 

014i jkm 

JSZ 

exp 

Branch  to  exp  if  (SO)  ■ 0 

01 5 i i km 

JSN 

exp 

Branch  to  exp  if  (SO)  ^ 0 

016 i j km 

JSP 

exp 

Branch  to  exp  if  (SO)  positive 

0 1 7 i ] km 

JSM 

exp 

Branch  to  exp  if  (SO)  negative 

020 i j km 

Transmit  jkm  to  Ai 

021i jkm ' 

A1 

exp 

Transmit  I's  complement  of  jkm  to  Ai 

022i)k  ' 

Transmit  jk  to  Ai 

0231  jO 

Ai 

Sj 

Transmit  (Sj)  to  Ai 

024ijk 

Ai 

Bjk 

Transmit  (Bjk)  to  Ai 

02  5 1 jk 

Bjk 

Ai 

Transmit  (Ai)  to  Bjk 

026i  jO 

Al 

PSj 

Pop/LZ 

Population  count  of  (Sj)  to  Ai 

027i jO 

Ai 

ZSj 

Pop/LZ 

Leading  zero  count  of  (Sj)  to  Ai 

030i jk 

Ai 

Aj  ‘Ak 

A Int 

Add 

Integer  sum  of  (Aj)  and  (Ak)  to  Ai 

t03010k 

Ai 

Ak 

A Int 

Add 

Transmit  (Ak)  to  Ai 

t030i jO 

Al 

Aj^l 

A Int 

Add 

Integer  sum  of  (Aj)  and  1 to  Ai 

031ijk 

Ai 

AJ  -Ak 

A Int 

Add 

Integer  difference  of  (Aj)  less  (Ak)  to  Ai 

♦031100 

Ai 

1 

A Int 

Add 

Transmit  -I  to  Ai 

031i0k 

Ai 

-Ak 

A Int 

Add 

Transmit  the  negative  of  (Ak)  to  Ai 

031ij0 

Ai 

Aj-1 

A Int 

Add 

Integer  difference  of  (Aj)  lets  1 to  Al 

0321jk 

Ai 

AJ»Ak 

A Int 

Mult 

Integer  product  of  (Aj)  and  (Ak)  to  Ai 

m 

1 


1 

J^ 
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CRAY- J 

CAL 

UNIT 

055i00 

Ai 

Cl 

0nj)0 

Al 

CA.Aj 

• 

OlBijl 

Al 

CE.Aj 

0T4i  }ii 

B)k  , Al 

,A0 

Memory 

' 0Ui;k 

Bj  k ,A 1 

O.AO 

Memory 

05Si jk 

.AO 

B j k . A i 

Memory 

^O^Sijk 

0,A0 

Bj  k .Ai 

Memory 

0161  jt 

Tj  k ,Ai 

.AO 

Memory 

’ Olb  i ) k 

Tj  k , Al 

O.AO 

Memory 

057i3k 

.AO 

Tjk.Ai 

Memory 

♦037iik 

O.AO 

Tjk.Ai 

Memory 

040  1 j km  1 
04 1 1 ) km  ( 

S 1 

exp 

0421  jk 

Si 

<exp 

S Logical 

Si 

* >exp 

* 0 4 2 J 7 ■' 

Si 

1 

S Logical 

0451 Jk 

Si 

<exp 

S Logical 

Si 

> >exp 

♦045i00 

Si 

0 

S Logical 

044  1 J k 

Si 

S)6Sk 

S l.ogical 

♦044ij0 

Si 

S)«SB 

S Logical 

♦044i jO 

Si 

SBSSj 

S Logical 

^04  5 uk 

Si 

»Sk«Sj 

S Logical 

^04  5 1 jO 

Si 

•SB6S) 

S Logical 

046  1 J k 

Si 

Sj\Sk 

S Logii.al 

♦046  1 J 0 

Si 

Sl\SB 

S 1 og i<  a 1 

• 1-46  1 ; 0 

s l 

SBtSj 

S Logiial 

047  1 J k 

Si 

•SjSSk 

S Logical 

♦04  TiOk 

Si 

fSk 

S Logical 

r047i  jO 

Si 

*Sj\SB 

S Logical 

tn4‘»i  jo 

Si 

• SBSSj 

S Logical 

♦047i00 

Si 

• SB 

S Logical 

USOl  Jk 

Si 

Sj iSjSSk 

S Logical 

+ 0501  jO 

Si 

Sj :Si5SB 

S Logical 

05  1 1 J k 

Si 

Sj  ;Sk 

S Logical 

t05l lOk 

Si 

Sk 

S Logical 

♦OSjgjO 

Si 

Sj  :SB 

S Logical 

+ 051  I jO 

Si 

SB.'SJ 

S Logical 

tOS]  lOO 

S 1 

SB 

S Logical 

0 5 j 1 J k 

so 

Sixexp 

S Shi f t 

055i  Jk 

so 

Sixexp 

S Shift 

054  1 ik 

Si 

Si <e  xp 

S Shift 

05  J 1 ) k 

Si 

Si >exp 

S Shift 

0561  jk 

Si 

Si  .SjeAk 

S Shift 

*056,1  jO 

Si 

Si .Sj<l 

S Shift 

*056i0k 

Si 

SlcAk 

S Shift  . 

DESCRIPTION 

Channel  nuaiber  to  Ai  fj’O) 

Address  of  channel  (Ai)  to  Ai  (j^O,  k-0) 

Error  flag  of  channel  (A))  to  Ai  (jfO,  k-1) 

Read  (Ai)  words  at  B register  jk  from  (AO) 

Read  (Ai)  words  at  B register  jk  from  (AO) 

Store  (Ai)  words  at  B register  jk  to  (AO) 

Store  (Ai)  words  at  B register  jk  to  (AO) 

Read  (Ai)  words  at  T register  )k  from  (AO) 

Read  (Ai)  words  at  T register  jk  from  (AO) 

Store  (Ai)  words  at  T register  jk  to  (AO) 

Store  (Ai)  words  at  T register  jk  to  (AO) 

Transmit  jkm  to  Si 

Transmit  I's  complement  of  ;km  to  Si 
Form  I's  mask  in  Si  from  the  right 

Enter  1 it. to  Si 

Form  I's  mask  in  Si  from  the  left 
Clear  Si 

Logical  product  of  (Sj)  and  (Sk)  to  Si 
Sign  bit  of  (Sj)  to  Si 
Sign  bit  of  (Sj)  to  Si  (jpO) 

Logical  product  of  (Sj)  and  I's 
complement  of  (Sk)  to  Si 

(Sj)  with  sign  bit  cleared  to  Si 
Logical  difference  of  (Sj)  and  (Sk)  to  Si 
Toggle  sign  bit  of  S),  then  enter  into  Si 
Toggle  sign  bit  of  Sj.  then  enter  into  Si  fj/0) 
Logical  equivalence  of  (Sk)  and  (Sj)  to  Si 
Transmit  I's  complement  of  (Sk)  to  Si 

Logical  equivalence  of  (S))  and  sign 
bit  to  Si 

Logical  eqt  ivalence  of  (Sj)  and  sign 
bit  to  Si  (jpO) 

Enter  I's  complement  of  sign  bit  into  Si 

Logical  product  of  (Si)  and  (Sk)  complement 
ORed  with  logical  product  of  (Sj  and  (Sk)  to  Si 

Scalar  merge  of  (Sj)  and  sign  bit  of  (Sj) 
to  Si 

Logical  sum  of  (Sj)  and  (Sk)  to  ‘ji 
Transmit  (Sk)  to  Si 

Logical  sum  of  (Sj)  and  sign  bit  to  Si 

Logical  sum  of  (Sj)  and  sign  bit  to  Si  (jFO) 

Enter  sign  bit  into  Si 

Shift  (Si)  left  exp  places  to  SO 

Shift  (Si)  right  exp  places  to  SO 

Shift  (Si)  left  exp  places 

Shift  (Si)  right  exp  places 

Shift  (Si  and  Sj ) left  (Ak)  places  to  Si 

Shift  (Si  and  S) ) left  one  place  to  Si 

Shift  (Si)  left  (Ak)  places  to  Si 
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CRAY- 1 

CAL 

UNIT 

— r— — • — ••  r • 

DESCRIPTION 

0571jK 

Si 

Sj ,Si >Ak 

S Shift 

Shift  (Sj  and  Si)  right  (Ak)  places  to  Si 

tOS7i  jO 

Si 

Sj ,Si>l 

S Shift 

Shift  (Sj  and  Si)  right  one  place  to  Si 

♦057i0k 

Si 

Si  >Ak 

S Shift 

Shift  (Si)  right  (Ak)  places  to  Si 

060i  jk 

Si 

Sj»Sk 

S Int  Add 

Integer  sum  of  (Sj)  and  (Sk)  to  Si 

r 061ijk 

Si 

Sj  Sk 

S Int  Add 

Integer  difference  of  (Sj)  and  (Sk)  to  Si 

+06M0k 

Si 

Sk 

S Int  Add 

Transmit  negative  of  (Sk)  to  Si 

- 062ijk 

Si 

Sj«FSk 

F.P.  Add 

Floating  sum  of  (Sj)  and  (Sk)  to  Si 

t0t)2  iOk 

Si 

•FSk 

F.P.  Add 

Normalize  (Sk)  to  Si 

063i  jk 

S) 

Sj -FSk 

F.P.  Add 

Floating  difference  of  (Sj)  and  (Sk)  to  Si 

t065i0k 

Si 

-FSk 

F.P.  Add 

Transmit  normalized  negative  of  (Sk)  to  Si 

0f)4  l j k 

Si 

Sj'FSk 

F.P.  Mult 

Floating  product  of  (Sj)  and  (Sk)  to  Si 

06Si]k 

Si 

Sj*HSk 

F.P.  Mult 

Half  precision  rounded  floating  product 
of  (Sj)  and  (Sk)  to  Si 

Obb  1 j k 

Si 

Sj*RSk 

F.P.  Mult 

Full  precision  rounded  floating  product 
of  (Sj ) and  (Sk)  to  Si 

(Jb7  1 ) k 

Si 

Sj*ISk 

F.P.  Mult 

2 - Floating  product  of  (Sj)  and  (Sk)  to  Si 

070i jO 

Si 

/HSj 

F.P.  Rcpl 

Floating  reciprocal  approximation  of 
(Sj)  to  Si 

071 iOk 

Si 

Ak 

- 

Transmit  (Ak)  to  Si  with  no  sign  extension 

071ilk 

Si 

♦Ak 

Transmit  (Ak)  to  Si  with  sign  extension 

07n2k 

Si 

♦FAk 

- 

Transmit  (Ak)  to  Si  as  unnormalized 
floating  point  number 

071150 

Si 

0,6 

- 

Transmit  constant  0.7S*2**48  to  Si 

071140 

Si 

0 . 4 

- 

Transmit  constant  0.5  to  Si 

071iS0 

Si 

1 . 

- 

transmit  constant  i.u  to  si 

071 ibO 

Si 

2. 

- 

Transmit  constant  2.0  to  Si 

071 1 70 

Si 

4 . 

- 

Transmit  constant  4.0  to  Si 

072i00 

Si 

RT 

- 

Transmit  (RTC)  to  Si 

O7MO0 

Si 

VM 

- 

Transmit  f VM)  to  Si 

074ijk 

Si 

Tjk 

- 

Tr ansm it  (T  j k ) to  Si 

0 Tm  j ^ 

Sj 

- 

Transmit  (Si ) to  Tjk 

0 7 b 1 j k 

Si 

Vj  .Ak 

- 

Transmit  (Vj , element  (Ak))  to  Si 

077ijk 

Vi  .Ak 

Sj 

- 

Transmit  (Sj)  to  Vi  element  (Ak) 

t077iOk 

Vi  ,Ak 

0 

Clear  Vi  element  (Ak) 

1 Oh  1 j km 

A i 

exp , Ah 

Memory 

Read  from  ((Ah)  ♦ exp)  to  Ai  (A0*0) 

tlOOi]km 

Ai 

exp  ,0 

Memory 

Read  from  (exp)  to  Ai 

tlOOi jkm 

Ai 

exp , 

Memory 

Read  from  (exp)  to  Ai 

♦lOhiOOO 

Ai 

,Ah 

Memory 

Read  from  (Ah)  to  Ai 

1 1 h i j km 

eifp , Ah 

AI 

Memory 

Store  (Ai)  to  (Ah)  ♦ exp  (AO-0) 

* 1 1 0 1 J km 

exp  ,0 

Ai 

Memory 

Store  (Ai)  to  exp 

M ]0i jkm 

exp, 

Ai 

Memory 

Store  (Ai)  to  exp 

tl  IhiOOO 

,Ah 

Ai 

Memory 

Store  (Ai)  to  (Ah) 

1 2h 1 j km 

Si 

exp , Ah 

Memory 

Read  from  ((Ah)  ♦ exp)  to  Si  (AO-O) 

tl20i jkm 

Si 

exp.O 

Memory 

Read  from  exp  to  Si 

1 1 20 i j km 

Si 

exp, 

Memory 

Read  from  exp  to  Si 

tWhiOOO 

Si 

.Ah 

Memory 

Read  from  (Ah)  to  Si 

1 5h i j km 

exp, Ah 

Si 

Memory 

Store  (Si)  to  (Ah)  ♦ exp  (AO-0) 

* tl 50i jkm 

exp,0 

Si 

Memory 

Store  (Si)  to  exp 

1 1 50 i j km 

exp. 

Si 

Memory 

Store  (Si)  to  exp 

*l  5hiOOO 

,Ah 

Si 

Memory 

Store  (Si)  to  (Ah) 

‘ . MOijk 

Vi 

SjSVk 

V Logical 

Logical  products  of  (Sj)  and  (Vk)  to  Vi 

tMOiOO 

Vi 

0 

V Logical 

CU*ar  Vi 

I41i]k 

Vi 

Vj»Vk 

V Logical 

Logical  products  of  (Vj)  and  (Vk,  to  Vl 

• 

1 
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CRAY  1 

CAL 

UNIT 

DtSCRIPTION 

142  1 

Vi 

S)  : Vk 

V Logical 

Logical  bUffls  of  fS)j  and  (Vkj  to  Vl 

M42  lOk 

Vi 

Vk 

V Logical 

Transmit  (Vkj  to  Vi 

14Mjk 

Vi 

vj : Vk 

V Logical 

Logical  sums  of  TVj)  and  fVk)  to  Vl 

144  1 j k 

Vi 

‘••iWk 

V Logical 

Logical  differences  of  ff.j)  and  fVk)  to  Vi 

I45»)k 

Vi 

Vj\vk 

V Logical 

Logical  differences  of  fVj)  and  fVk)  to  Vi 

1 4f)  1 ) k 

Vi 

Sj  : Vkf,VM 

V Logical 

Transmit  fSj)  if  VM  Lit  ■ 1,  (VL)  if 

VM  b i t • 0 l o Vi 

t M M 0 k 

Vi 

• VM^.Vk 

V Logical 

Vector  merge  of  (Vk)  and  0 to  Vi 

1 47  1 jl. 

Vi 

vj : vkf,vM 

V Logical 

Transmit  fVj j if  VM  bit  ■ 1.  fVkj  if 

VMbit  • 0 toVi 

ISOijk 

Vi 

Vj  'Ak 

V Shift 

Shift  (Vj)  left  fAk)  placr-s  to  Vi 

M ^ 0 1 j 0 

Vi 

V)'l 

V Shift 

Shift  fVj)  left  one  place  to  Vi 

1 M 1 j ^ 

Vi 

Vi  >Ak 

V Shift 

Shift  (Vj)  right  (Ak)  places  to  Vi 

M 1 1 ) 0 

Vi 

Vj>l 

V Shift 

Shift  (Vj)  right  one  place  to  Vi 

r. : i j k 

Vl 

Vj .Vj  <Ak 

V Shift 

Double  shift  fVj)  left  (Ak)  places  to  Vi 

M -i  2 1 j 0 

Vi 

< 

< 

V Shift 

Double  shift  (Vj)  left  one  place  to  Vi 

1 S ^ 1 J k 

Vi 

< 

> 

> 

V Shift 

Double  shift  fVj j right  (Ak)  places  to  Vl 

I 1 u 0 

Vl 

V) ,Vj  > 1 

V Shift 

Double  shift  (Vj)  right  one  place  to  Vi 

I '.4  1 jk 

Vl 

',j  ‘Vk 

V lot  Add 

Integer  sums  of  (Sj)  and  (Vk)  to  Vi 

I '<  j 1 j k 

Vl 

Vj'Vk 

V lot  Add 

Integer  sums  of  (Vj)  and  (Vk)  to  Vi 

1 0 (•  1 j k 

Vl 

S)  Vk 

V lot  Add 

Integer  differences  of  fS)j  and  (Vk)  to  Vi 

‘ I 'if- 1 Ok 

Vl 

■Vk 

V lot  Add 

Transmit  negative  of  (Vk)  to  Vi 

r.  7 , . ► 

Vl 

Vj  Vk 

V lot  Add 

Integer  differences  of  (Vj)  anil  (Vk)  to  Vl 

1 f>  0 j ) k 

Vl 

Sj'IVk 

F.P.  Mult 

floating  products  of  (Sj)  and  (Vk)  to  Vi 

1 M i j k 

Vl 

Vj  *kVk 

F.P.  Mult 

floating  products  of  (V))  and  (Vk)  to  Vi 

1 6 2 1 j k 

Vl 

S)*HVk 

F.P.  Mult 

Malf  precision  rounded  floating  products 
of  (Sj  j and  (Vk ) to  Vi 

1 f-  1 i f k 

Vl 

Vj'IIVk 

F.P.  Mult 

Malf  precision  rounded  floating  products 
of  (V) ) and  (Vk ) to  Vi 

1 f>  4 I ) k 

Vl 

Sj*RVk 

F.P.  Mull 

Rounded  floating  products  of  (Sj)  and 
(Vk  ) t*.  V 1 

1 h “i  1 j k 

Vl 

Vj'HVk 

F.P.  Mult 

Roundeii  t mat  mg  prouucis  ui  i«ji 
(Vk)  to  Vl 

1 f»fi  1 J k 

Vl 

Sj*lVk 

F.P.  Mult 

2 f 1 oa t 1 ng  p r A f u<  t s of  f S j ) .ind 

(Vk)  to  Vl 

I'.-l  Ik 

Vl 

Vj*IVk 

IP.  Mult 

2 float  It  g produils  of  (Vj)  and 

(Vk ) to  Vl 

1 70  1 j k 

V 1 

'.j»l  Vk 

IP.  Add 

floating  ■ um . ol  (Sj)  and  (Vk ) to  Vi 

‘ 1 70 (Ok 

Vl 

•IVk 

1 P.  Add 

Normal i /e  ( Vk  » to  Vi 

1 ^ I U k 

Vl 

Vj ♦! Vk 

IP.  Add 

Floating  sums  of  fVji  and  (Vkj  to  Vi 

1 ^ 2 1 1 k 

Vl 

Sj  IVk 

IP.  Add 

floating  differences  of  fSj)  and  (Vk)  to  Vi 

t 1 72 lOk 

Vl 

FVk 

F.P.  Add 

Transmit  normalized  negatives  of  (Vk)  to  Vl 

I7^i)k 

Vl 

Vj  FVk 

F.P.  Add 

Floating  differences  of  (Vj)  and  (Vk)  to  Vi 

I 7 4 1 I 0 

Vl 

/MVJ 

F.P.  Rcpl 

floating  reciprocal  appron imat ions  of 
fVj ) to  Vl 

I 7 S 0 j 0 

VM 

Vj.: 

V l.ogic.il 

VM» 1 where  (V) ) • 0 

17S0J  I 

VM 

V)  ,N 

V Logical 

VM- 1 where  (Vj  1 f (i 

|7''>Oj2 

VM 

Vj  ,P 

V Logical 

VM- 1 where  (Vj ) positive 

l7S0j^ 

VM 

Vj  ,M 

V Logical 

VM-1  where  (V) ) negative 

170iOk 

Vl 

,AO,Ak 

Memory 

Read  (VL)  words  to  Vl  from  (AO) 

Incremented  hyfAk) 

t 1701 00 

Vl 

.AO,  I 

Memory 

Read  (VL)  words  to  Vi  from  (AO) 
inc  remented  by  1 

|770jk 

.A0,Ak 

V) 

Memory 

Store  (VL)  words  from  Vj  to  (AO) 
incremented  by  (Ak) 

ti770j0 

,A0, 1 

Vj 

Memory 

Store  (VL)  words  from  Vj  to  (AO) 
incremented  by  1 

t Special  syntax  form 
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Appendix  F. 

Cray-l  Simulator  Error  Stops 

This  appendix  discusses  possible  (though  unlikely)  simulator 
’ error  stops.  These  error  stopr  reflect  internal  simulator  errors 

> that  could  adversely  affect  simulation  results  if  the  simulation 

was  allowed  to  proceed. 

I I 

! j The  user  program  that  calls  the  simulator  as  a subroutine 

i could  cause  an  error  stop  to  occur  by  over  writing  simulator 

tables  and  data  structures.  An  invalid  TABLES.DAT  file  could  also 
cause  an  error  stop. 

■ Some  error  stops  print  an  error  message  prior  to  halting, 

^ other  stops  only  indicate  a stop  code.  The  list  of  error  stops 

' below  are  separated  into  two  groups:  those  that  print  a message 

and  those  that  indicate  only  a stop  code.  The  simulator  subpro- 
gram issuing  the  error  stop  is  also  noted  below. 


I 

4 


Error  Stops  with  Messages 

Origin  of  Stop  Stop  Message 

TABTNI  RECORD  OVERRUN  IN  TABINI.  CAN'T  INITIALIZE. 

TAB INI  TABINI  UNABLE  TO  READ  DECODE  FILE. 

DECODE  INTERNAL  ERROR.  DECODE  TABLES  CLOBBERED. 

SIMBRK  SIMBRK  CALLED  BUT  BRKSET  i ZERO. 

SIMBRK  SIMBRK  CALLED  BUT  BREAKPOINT  NOT  IN  TABLE. 
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Error  Stops  With  Stop  Codes 


Origin  of 
Stop 

Stop 

Code 

MSW 

101 

MSW 

102 

SIMCON 

103 

SIMCON 

104 

OP  ROC 

105 

QWRTTE 

106 

QWRTTE 

107 

SETMSK 

108 

BLDMSK 

109 

DECODE 

113 

SIMCON 

114 

RESERV 

115 

ACTION 

116 

QWRITE 

118 

RESGOO 

200 

RESGOl 

201 

RESG02 

202 

RESGOl 

203 

RESG04 

204 

RESGOS 

205 

RESG06 

206 

RESG07 

207 

RESG14 

214 

RESG15 

215 

RESG16 

216 

RESGI7 

217 

TRACK 

300 

ENTRAP 

400 

ENTRAP 

401 

ENATTN 

402 

ENATTN 

403 

RESG07 

1071 

Comments 

Invalid  bit  code  in  MSW. 

Invalid  argument  to  MSW. 

Unimplemented  action  code  used. 

II  II  M II  II 

Invalid  queue  action  code. 

Queue  space  exhausted. 

Invalid  queue  pointer. 

Invalid  bit  code  in  SETMSK. 

Invalid  hold  issue  code. 

Bad  instruction  format  code. 

Invalid  action  code  used. 

Invalid  reservation  code. 

Action  held  and  Queue  empty 
Invalid  clock  period  argument. 

Invalid  G-field  dispatch  code. 

M II  II  ft 

It  ft  ft  t* 

It  ft  tt  II 

It  It  It  tt 

It  tt  tt  tl 

It  tt  tt  It 

tt  tl  tl  It 

tt  It  tt  tl 

It  It  tt  tt 

It  It  It  tl 

tt  tt  tt  tt 

Invalid  track  command  code. 

Floating  point  interrupt  process  failure. 

If  It  It  tt  M 

Attention  process  failure. 

II  It  It 

Invalid  J-field  dispatch  code. 
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Appendix  G. 


Name : 

Language : 

Storage : 

Operating 

System 

Requirements : 

Other 

Requirements : 
Availability: 


Program  Availability  Information 


Cray-1  Simulator 

IBM  360/370  Fortran-IV 

IBM  360/370  Assembly  language 

Major  revision  required  to  run  on  a non-IBM  360/370 
series  machine.  (32  bit  word,  byte  addressability) 

270,000  bytes  with  a 4,096  word  Cray-1  memory. 


The  simulator  runs  on  the  Michigan  Terminal  System  (MTS) . 
Changes  will  be  necessary  to  interface  with  other 
operating  systems  and  other  Fortran  I/O  systems. 


none 

Source  language  programs  available  from  Professor 
D.A.  Calahan  on  9-trac)c,  800  BPI  magnetic  tape. 
The  program  is  roughly  11,000  records  long. 

Professor  D.A.  Calahan 
University  of  Michigan 

Dept,  of  Electrical  and  Computer  Engineering 
Ann  Arbor,  MI  48109 
(313)  763-0036 
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IHIS  PACE  IS  BEST  QUALITY  FRAC II CABLE 
7DCM1  COPY  FUIKNISEEL  10  U1)C 

Appendix  H. 

Sample  Simulator  Exit  Dispatcher 


r 
c 

r 
C 
r 
r 
c 
r 
r 
r 
c 
r 
r 
r 
c 
c 
c 

C COIHON  BLOCF  •''■•'Opr. 

r 

Un'tPT. ’’  r>t>r'ri''IO'J 

/rSTYF/  '1F'*ST7, 

T.fTpTPf> 

CO"»»n>l  /"Tmnvy/ 

POnTVM.’'NC’'  (’'’=■1(1)  » 
r 
r 
r 
c 

r ...  DTSPHTrii  CN  THp  RYT-^  rnUR  (T.TK) 

C 

no  TO  (100 ,700)  , T.TK 
.'TPnr . 

RPTMOV 

r 

r ...  rnoE-1  TNTTTHLT7.r  "'HTniX  (A1->B((SF,  »(?*ST7,P) 

100  »r=:A*i>  (2*  1 ) 

(1*1) 

s''RnnR  - '«noo'- 

nn  v»  1 7 = 1 , N 

If-  .t 

no  110  T=i,*i 

PP1  (lAnnp  *1)  = K 

nnps»)Annpf  i 

K=K-1 

TF(r.LT.))  if  = n 
11.1  rnNTT”'ip 
lao  roHirNMi 

PPTnWH 


film  ( 1 ) 

T1RP  (2,  no^f') 
r''P«(i,i),  HiEi(i)) 


f^MPPOU'^TN  P r”  l\YPV  (r.lK,  k$P,  S$p,  VSR,  VI,  PXSW) 
TUPI  TCtt  TNTFnpP  ( A-'T) 
lOOTCAI,  EXSW,  NOA  NS/.  TRUE.  / 

TUTW^wR  ASF(B),  f>*»'C(7) 

»»RAi.*i  sSR(p),  vtp(f.4,fl),  ns 
ROi»TVALPNC’'(»TC(1)  ,nS) 


T'lTs  pxTT  FPorRSsnp  IS  nspD  ny  a puli,  patrix  lh  pactorization 
PPOORA*.  two  PYIT  witNCTICNS  AP”  rpCWinSD: 

FX  1 - INTTIALTZES  tup  SPUARf  raTRIX  is  CRAY-1  HRRORY. 

RPSTSTRR  A1  POINTS  TO  THK  1ATRTX. 

PROTSTRR  A?  CONTAINS  THP  1ATFIX  STZP. 

PY  ■>  - PRINTS  OtJT  THR  PUN  ’"11"  ANT  THF  1ATPTX  SIZR. 

PPOISTRR  A7  CONTAINS  TffP  1ATR’’X  SIZE. 

RROTSTRO  S7  CONTAINS  TfiP  praI  Til'  CLOCK  VALOR. 

optipnally,  IE  Tifi-  loo.tcal  variable  'ncans'  is  .PALSR., 

THEN  TUP  ”EX  2"  WILL  ALSO  drtnt  THE  1ATRIX  SOLUTION. 
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THIS  TAGS.  IS  BEST  QUALITY  PRACTICABJUS 

/ROM  COPY  f UmSHUl  10  uoc  ^ 


r 

r 

''  ...  nnw=2  nPTUT  TM’'*  "■T*"’.  {H  - 'lATPTT  ’^17?,  *^7  = PUM  TTflF) 

?0f)  m C^*!) 

WnrTP  1.)00)  ^*P(7♦1),  PTC(?) 

nv)  ) 'T7P  = *,r'i,*  p'pr=’,r7) 

r '•YSW  = .'^PP''. 

TPflOANP)  PPTIIPP 

r 

nn  ?S')  i,»i 

2*))  WPTTP  (fi,  1 100)  SP^OnP*  f.l- 1)  *N»I  ),0=1,*l) 

1107  ppP'^A'^flX,  10P7.1) 
r 

pP'T'ffPP 

C 

r 

FNH 


Appenaix  l. 

Sample  Simulator  Calling  Program 


C 

' r 
r . . 

r 

* C 

r 

r 

r 

r 

C 

r 

C 

r 

r 

r 

r 

c 

r 

r 

r 

c 

r 

. c 

r 

r 

r 

c 

r 

r 

r 

r 

r 

r 

r 

r 

r 

r 

r 

r 

r 

r 

r 

r 

a C . a 

r 


r 

C .. 


T-niirTT  TNTPnFP(ft-7)  JHH  PAffB  IS  BEST  QUALITY  FRACIICABLS 

ntOM  COPY  FUW^  I SHED  TO  DUG  • 

TUT'*  *1ATN  npnn'”\'1  C"LI,U  t n «;  C'»AY-1  «:Tf'tlIATCP  AS  A StJI  POU'^' I M E 
T'^  Sni.VR  rAPATLPf.  SYSTEMS  OF  TRI-CIArONAL  FQMATTONr..  UP  TO  64 
PA»>AI.LFI,  SyST’=’!1S  'lAV  P?  SOIV’^D. 

TUTS  npoOFA’"  '''’opoowq  tHF  ''OLLOW^  NO  FUNCTIONS: 

1.  OFAOS  >^40  IN^'tl'"  f>A»  A FF'^FPS: 

NSYS  - THE  MUIPFP  OF  PAPALLFL  EYSTFFS  10  SOLVE. 

VECS  - -^UF  NOm^P  OF  EOUA'^rCNS  IN  EACH  SYSTEN. 

?.  ALI.nCA’^F.S  -^MF  OPAV-l  f’^FORY  ’'CP  THE  THPFF  DIAOONALS  ANO  THE 
PTOHT  HANO  STOE. 

■».  YNTTTAir7''S  -^I'E  SYSTFFS. 

4.  I.OAnS  THE  OPAY-I  TPT-nTAOONAL  !,')  DFCONPnsiTION  ROUTINE  (TRIDEC) 

AVn  TNIFTAT.T7'=*S  '’'HF  CAII.INS  PAPAFFTERS  IN  AN  AODPESS  LIST 
IN'"  F riATFT.Y  PPPCF^OTNO  THE  lOAEFn  PROORAF. 

SIVPS  CONTROL  TO  Tpp  VIA  THF  STNULATOR  CONMAND  LANSHAGE, 

ALLOWING  thf  uffp  to  RtIN  TH ’=’  FFCGPAN,  S »’T  BREAK  POINTS,  BTC. 

f.  NOON  nPTljnN  prr)y  tr  p SIEHLAtoP,  FPINT  OUT  trp  SYSTEI  AND 
CALCULATE  t r p m flops. 

1.  LOAFS  TUE  PACF-SUnSTITUTIOM  CPAY-I  PHOORAN  (TRISLV)  AND  INITIALIZES 
ITS  CAI  T INS  r>np  Ar*'TPi5c.  ^ 

E.  AGAIN  GIVEG  CONTROL  TO  trp  tjSEP  7C  RUN  TUE  PPOGPAN. 

0.  UPON  P’^TfioN  ppo  N THP  STFULATOP,  IPINT  OUT  tHp  SYSTEH  AND 
CALCni.AT*'  trp  mrloPS. 


THIS  nPOOPAN  T|y«;r.c:  n-Rp  PLACE  OP  STNULATOP'S  NMN  PROGRAE  SINCE 

’’T  ’’S  rCAPFU  PTRST.  IT  ALSO  EKTFNDS  THE  SIMULATOR  MEMORY  TO  8192 
WORDS. 


COMMON  /PAP*"S/  NSYS,  N EOS  , AUASE,  SPASF,  CHASE,  YBASE 


TUP  FOLLOWING  IS  AN  PT'T'NSTON  0*  TUF  CPAY-1  SIMULATE’)  flEMOPY. 

DOUBL®  PPpCISTON 

COMMON  /memory/  *MMfS10?| 

COMMON  /MST7.T/  MPNOIF 
YNTEGEPMO  HMFM(1776E) 

INTEGER  ITMM  (?, 819P) 

EOUTVALENCF  (MEM(1),rMEM<  1,1)  ,nHEM(l)) 

TET.T  THE  STMnLATOP  THE  NEW  SIZE  OF  CPAT-1  MEMOPY. 

NMMCIT  = 8107 

CAIT.  CRAY  U'TNTT*  • , .TRUE.) 


Ill 


T n 


THIS  PAGE  IS  BEST  QUALITY  FRACTICABLI 
EMWI  COPY  i'UKNlSHEC  XO  LLC  __ 


r 


r 

c 

c . . . 

IT 

- 

I pIP'^ottON 

INOPR  M'NT. 

r 

THE 

DTSTANCR  ORTWEFN  DIAGONAL 

r . . . 

T.1 

- TffR 

T nTPr-OTTON 

INCPRMRNT. 

r 

Ttip 

DTSTANf'R  prTVRFN  PARALL'L 

r 

r 

IT 

= 1 

100))  'Jsrs,  M POS 
Don  DT>•>!^'r  (Tn /J^) 

. nT.  fi'i)  r,n  TO  O10 
C 

T.l  = MPOS 


ELEMENTS. 

ELEMPNTS. 


r 


r 

r ..  . 

C 

r 

r 

r ..  . 


10 

r 

r ..  . 


?n 

r 

^ 200 

r. 


SPT  AP^^Y  EASFS. 

CTATI?  = 

/(RASP  = rFRrip  ♦ MSYS^NPOS 
nRARu  = APA''*’  * SSYS*N’’00 
YRASP  = nPASP  ♦ VSYS*NP03 

TPfYPASP  ♦ *>•((  .r,T.  MP»*ST7)  30  TO  900 

TNT’»'taLtxp  TP'^nPr*  rr>AY  "'PMOPY  WTT'I  the  TPI-nTA30NAL  DATA. 


T.oon  <r*iRri  TVr  Ft.r'*''YTS  A SYSTEM  TO  T *J  TTT  A I,  I ZR . 
no  1 ) I = 1 , N Roq 


T.ron  ■’’Mpn  alt.  •’a'>at.l''i  sys'^rms. 

DO  10  .1  = 1,MSYS 


"•RM  (CDA (0- T)  *1.1  4-  (T-1)) 

MR*  (ADA  S'*  (.1-1)  *T,j  ♦ (T-1)) 

M'M  (PTIA  SR*  f.1- 1)  *I.T  * (T-1)) 
*RM  (YBA*-”*  (.1-1)  *T.I  * (T-1)) 
rovTTND  R 


J.  IRQ 
1 . JDO 
T/10. JDO 
1*1.900 


OLRAP  OUT  THP  TOP  op 
no  0 ) ,T  = 1,NSV'' 

MF*  (CSA  SR*  (.1-1)  *T.T 
M'*  (BSA  S'*  (.1-1)  *T.1 
rONTTNOR 


r AN  D THP  BOTTC  M OR  p. 

* (1-  1)  ) =0.  ODO 

♦ (N'OS-1))  = C.')P0 


N'T'’'?  (14,  12  )0) 

POPOAT (• irPAY -1  TPT-DTAnOK AL  SOLVER') 
OAT  I ofipr  K (0,  .'AI.SR.  ) 
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THIS  PA&E  IS  BEST  QUALir/  PRACTICABLE 

raOJH  COtY  FURHISHiD  TO  DOC  

r 

r ...  '^^'T  II"  Till:  CML  rincr  wtt‘I  FrTrT*'"^  to 

r T'l*'  ^ P".'!**  PVT"  . 

c 

r T rv-  i)j  r HTYS,  T,o<"  iji  = NFO?>,  TOC  10;  = TT,  I.OC  103  = TJ, 

r lo*^  134  = rinr*'.  irMr-MCj)  - cpay-1  flroFP'^s  tfro.) 

r 

!"'=•"(?,  1*^  1)  = R 

T'*i"'f'>,1)*1)  - Ri 

TMOM  ( 1 1 ♦ 1)  r r,A 

TIPiF?,  12*1)  = TRURE-  1 

( 2,  1 3 ♦ 1)  = ^ 1 H R''-  1 

’■"PM  (7,  14*1)  = CRASP 

IMP"  (1,  1*i*  1)  ^ r,r, 

T«pM(7^1f,t1)  = 

C 

r ...  RPT  ’I"  APCTIMpvT-  inrATTow.R. 

TMPM  (7,RR*  1)  = ) 

TMPM  r7,fi7  fl)  = TT 
TMPM(2,rR»1)  = TT 
TIP"  (7,  ftS  ♦ 1)  = *IP,0''> 

IMP"  (7, 64  ♦ 1)  'IPYP 
r 
C 

I"  ...  T/IAD  TP’Opr  ANfi  '1T7F  STinr.ATOP  rONTFCL  TO  T’»P  USER. 

CATI  CPAY1('rni  APTPO  ■’’Pinpr  I.OAnR,  run  21A  to  STATTT.  !',  .PALSP.) 
rAT.I.  rPAY  1 (•  IT  AT  SOTOtTPTnPC;  USE  * »1  SOURCE* !' , .FALSE.) 
r 
r 

WOPS  = VSY*^  * (1  ♦ (NPOP-1)*4) 

TALI  rHErPfNors,  .i’ptip. ) 
r 

C ...  T *I  'I' T 1 r ’■  T *■  TU’SI.'MS  f, PN*’’  PT.OCK  W T 1 ?'  TTS  "OIVTERS. 
r 

T-rx  (7,  1 ) ♦ 1)  ~ r.,i\ 

TM""  ( 7,  1 1 ♦ 1)  = Y"  1 S"'  1 

TMpM  (7,  17  ♦ 1)  =67 
TIP"  (7,13*1)  =66 

’^1PM(7,  14  *1)  = RRASP  - 1 

tmt-w  (7,  16*  1)  = ARAPf-  1 

TMP1  (7,  16*1)  = CR  ASE 

TMpm  (2,17*1)  = 66 

T1*M  (7,  1R  * 1)  =64 

C 
C 

r ...  lOAn  Tor^Lv  ano  '.iv^  stmulatop  contfci  to  uspr. 

CATT,  rpAYlCrri  apTP"  LOAHS,  RUN  2 lA  TO  START.!',  .FALSE.) 

CALL  CRAY  1 CIOAO  SOT’O  ; Tt  SI  V;  USE  *1?CnPCE*!',  .pALSE.) 
r 

MOPS  ^ MSYS  • ((HP0S-1)*2  * VPnS*3) 

CALI  rfIPrK(MOrs,  .TPUF.) 

C 

r 

STOP 
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c 

C ...  NOT  •'Nntfr.H  xptinny  pon  tI'K  PPOPLF^  ST7E. 

900  iuvnon  = (»1''wcT7.  - C9A‘^^)/4 
WPTTF  (fif^OOO)  HAKOPO 

9000  ’^OP"!A‘^  (• ')C»>Ay- 1 I’^'^OPY  -pno  9MAIT  POP  Till':  pnnPL’"1  9I7R.'/ 

♦ • TPE  1,AP''.P0T  Ppf'P'TCT  OF  N5Y£*NFr)5  >10'', T BE  < ' , T '>) 

OTOn 

r 

O ...  Tnn  lAMT  PAPAT.I.'"T.  OvBTpmc. 

9 10  WPTTE  (fi,9  J 10)  MOYS 
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Appendix  .7. 


Glossary  of  MTS  Terms 

For  those  readers  unfamilier  with  MTS,  this  glossary  defines 
the  MTS  terms  used  in  this  report. 

Attention  Interrupt  - 

The  attenti^  interrupt  is  a means  of  allowing  the  MTS  time- 
sharing user  ^ regain  control  over  an  errant  job.  The  user 
at  a terminal  i ay  signal  an  attention  by  hitting  the  terminal 
break  key  or  typing  a control-E.  If  the  program  being  run 
does  not  intercept  the  attention,  MTS  will  field  it  and  stop 
the  program. 

FDUB  - 

A file  or  device  usage  Jplock  (FDUB)  is  a data  structure  used 
by  MTS  to  control  I/O  to  files  and  devices.  A program  may 
request  MTS  to  create  a FDUB  for  a file  or  a device  that  is  to 
be  accessed.  Upon  creation  of  the  FDUB,. MTS  returns  to  the 
program  with  a pointer  to  the  FDUB.  The  program  may  then 
use  this  FDUB  pointer  as  an  argument  to  the  MTS  subroutines 
READ  and  WRITE  which  will  control  the  I/O  to  the  file  or  device. 

The  simulator's  command  stream  input  stack  is  simply  a 
stack  of  FDUB  pointers  to  all  the  files  currently  open.  Within 
each  FDUB,  MTS  remembers  the  line  number  of  the  next  record 
to  be  read. 

Keyboard  Attention  - 

See  Attention  Interrupt. 


MTS  - 

MTS,  the  Michigan  Terminal  System,  is  a terminal-oriented 
time-sharing  operating  system  that  offers  both  batch  and 
terminal  facilities.  Currently,  MTS  is  run  on  an  Amdahl 
470/V6  computer  at  the  University  of  Michigan. 
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*MSINK* 


*MSINK*  is  a pseudo-device  name  used  by  MTS  to  refer  to  the 
master  sink  (master  output)  device  for  the  job.  In  batch 
mode,  the  master  output  device  is  the  line  printer.  In  ter- 
minal mode,  the  master  output  device  is  the  terminal  printer. 

*MSOURCE*  - 

*MSOURCE*  is  a pseudo-device  name  used  by  MTS  to  refer  to  the 
master  source  (master  input)  device  for  the  job.  In  batch 
mode,  the  master  input  device  is  the  card  stream.  In  terminal 
mode,  the  master  input  device  is  the  terminal  keyboard. 

♦PRINT*  - 

♦PRINT*  is  the  pseudo-device  name  used  by  MTS  to  refer  to  the 
line  printer.  This  allows  a terminal  user  to  direct  output  to 
the  line  printer  by  specifying  *PRINT*  as  the  output  device. 

MTS  assigns  the  terminal  user  a receipt  number  used  to  pick 
up  the  output. 

♦SOURCE*  - 

♦SOURCE*  is  a psuedo-device  name  used  by  MTS  to  refer  to  the 
current  input  device.  MTS  reads  its  v'ommands  from  this  pseudo- 
device. When  a terminal  or  batch  job  begins,  *SOURCE*  is  the 
same  as  *MSOURCE*.  But,  unlike  *MSOURCE*,  whose  definition  is 
fixed,  *SOURCE*  can  refer  to  another  file  or  device.  The  de- 
finition of  *SOURCE*  is  under  user  control  via  the  MTS  command 
"$SOURCE  fdname",  where  fdname  is  the  file  of  device  name  which 
the  user  wishes  to  be  defined  as  *SOURCE*,  Therefore,  any  pro- 
gram, or  MTS,  which  reads  from  *SOURCE*  will  read  from  the 
file  or  device  specified  in  the  $SOURCE  command. 

♦SINK*  - 

♦SINK*  fills  the  same  role  for  the  current  output  device  that 
♦SOURCE*  does  for  the  current  input  device.  The  user  may  change 
♦SINK*  through  the  MTS  command  "$SINK  fdname".  When  a terminal 
or  batch  job  begins,  *SINK*  is  defined  as  *MSINK*. 
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Appendix  K. 
Load  Module  Formats 


1 . Absolute  Modules 

An  absolute  load  module  is  composed  of  a header  record  and  one 
or  more  parcel  records.  The  header  record  is  composed  of  three  free 
formatted  fields:  a starting  parcel  address,  a module  length  and  an 
optional  title  as  shown  below. 

p-addr  length  [title-string] 

Each  field  is  separated  by  blanks.  The  p-addr  field  contains  a modi- 
fied octal  (octal  word  address  followed  by  an  A,B,C  or  D parcel  code) 
parcel  address  where  the  first  parcel  is  stored.  The  length  field 
contains  the  octal  number  of  parcels  following  on  subsequent  parcel 
records.  The  title-string  field  contains  the  title  that  is  placed  in 
the  title  field  of  the  CPACT  report  (also  see  the  SET  TITLE  = title- 
string command) . The  title  field  begins  with  the  first  character  after 
the  blank  which  terminates  the  length  field  and  may  contain  embedded 
blanks.  Only  the  first  35  characters  are  retained.  The  LOAD  command 
will  echo  the  header  record  information  to  the  output  device. 

The  parcel  record  portion  of  the  absolute  load  module  contains 
one  or  more  free  formatted  records.  One  or  more  parcels  may  be  pro- 
vided on  each  record  in  octal  or  modified  octal  format  and  must  be 
separated  by  at  least  one  space.  A colon  may  be  placed  anywhere  on 
a parcel  record  which  will  stop  the  scan  of  the  record  at  that  point. 
This  allows  comments  to  be  placed  to  the  right  of  the  colon.  Blank 
records  in  the  parcel  record  portion  are  ignored.  The  number  of  par- 
cels in  the  parcel  record  portion  of  the  load  module  must  correspond 
to  the  octal  length  specified  on  the  header  record. 

All  records  in  an  absolute  load  module  must  be  less  than  81  char- 
acters. Since  the  load  modules  are  represented  in  character  format 
it  is  a simple  matter  to  patch  a module  with  the  editor.  If  you  add 
or  delete  parcels,  you  must  update  the  octal  length  appropriately. 


Example : 

21a  3 A TITLE 

: THIS  IS  AN  EXAMPLE  OF  A 3 PARCEL 

: ABSOLUTE  LOAD  MODULE 

022133  032211  : Al  33  ; A2  Al*Al 

004000  : EX  O 

$ENDFILE 

2 . ReJ.ocatable  Modules 

Relocatable  modules  consist  of  seven  types  of  binary  records.  An  IDEM 
record,  one  or  more  TXT  records,  zero  or  more  RLD,  EXT,  ENTR,  and 
SYM  records,  and  an  END  record. 

An  IDEN  record  identifies  the  name  of  the  module.  The  record 
consists  of  the  characters  IDEN,  followed  by  4 spaces,  followed  by 
the  8 character  name  of  the  module. 

A TXT  record  contains  the  actual  object  code  to  be  loaded.  It 
consists  of  the  letters  TXT,  followed  by  one  space,  followed  by  a 
four  byte  binary  address  of  this  portion  of  the  module  (relative  to 
the  top  of  the  module) , followed  by  a four  byte  binary  length.  The 
actual  text  to  bo  loaded  is  on  the  following  card. 

An  RLD  record  identifies  the  locations  in  the  module  which  must 
be  relocated.  It  consists  of  the  letters  RLD,  follov/ed  by  one  space, 
followed  by  one  or  more  8 byte  fields.  The  first  4 bytes  of  the  field 
contain  the  binary  address  (relative  to  the  top  of  the  module)  of  the 
text  to  be  relocated.  The  second  4 bytes  contain  a number  describing 
the  type  of  relocation  to  be  performed.  See  RLD  & EXT  types,  below. 

An  EXT  record  identifies  the  locations  in  the  module  which  refer 
to  external  locations.  It  consists  of  the  letters  EXT,  followed  by  5 
spaces,  followed  by  one  or  more  16  byte  fields.  The  first  8 bytes  of 
the  field  contains  the  8 character  name  of  the  external  location  re- 
ferenced. The  next  four  bytes  contain  the  binary  address  (relative  to 
the  top  of  the  module)  of  the  text  referencing  the  external.  The  last 
4 bytes  contain  a number  describing  the  type  of  reference.  See  RLD  & 
EXT  types,  below. 

An  ENTR  record  identifies  entry  points  in  the  module.  It  consists 
of  the  letters  ENTR,  collowed  by  4 spaces,  followed  by  one  or  more  12 
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byte  fields.  The  first  8 bytes  of  the  field  contain  the  name  of  the 
entry  point,  and  the  last  4 bytes  contain  the  address  (relative  to 
the  top  of  the  module)  of  the  entry  point. 

A SYM  record  contains  definitions  of  all  symbols  in  the  module 
which  may  be  referenced  by  the  simulator  command  language.  It  con- 
sists of  the  letters  SYM,  followed  by  5 spaces,  followed  by  one  or 
more  16  byte  fields.  The  first  8 bytes  contain  the  name  of  the  sym- 
bol. The  next  4 bytes  contain  the  value  of  the  symbol.  The  last  4 
bytes  contain  a number  identifying  the  type  of  the  symbol.  See  SYM 
types,  below. 

An  END  record  terminates  the  module.  It  consists  of  the  letters 
END,  followed  by  1 space.  If  a START  pseudo  op  was  contained  in  the 
program,  the  next  4 bytes  contain  an  address  (relative  to  the  top  of 
the  module)  of  the  starting  location. 

RLD  and  EXT  types:  1:  Two  parcel,  parcel  address 

2:  Two  parcel,  word  address 

3:  Four  parcel,  parcel  address 

4:  Four  parcel,  word  address 


SYM  types: 


1:  Parcel 

2:  Value 

3 : Word 
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